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The invention provides a transgenic animal having within its genome a transgene construct for gastrointestinal tract specific expression 
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sequence including a salivary gland protein promoter/enhancer. Also provided are methods of expressing and producing proteins using such 
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TRANSGENIC ANIMALS EXPRESSING 
SALIVARY PROTEINS 

FIELD OF THE INVENTION 
5 The present invention relates to transgenic animals and, more specifically, to animals 

genetically modified to express a desired protein. 

BACKGROUND OF THE INVENTION 

Phosphorus is an essential element for the growth of all organisms. In livestock 
10 production, phosphorus deficiency has been described as the most prevalent mineral 
deficiency throughout the world and feed must often be supplemented with inorganic 
phosphorus in order to obtain desired growth performance of monogastric animals (e.g. pigs, 
poultry etc.). 

Phytic acid, or phytate, (myo-inositol 1,2,3, 4,5, 6-hexakis dihydrogen phosphate) is a 

15 major storage form of phosphorus in cereals and legumes, representing 18% to 88% of the 
total phosphorus content (Reddy et al. 1982). The enzyme phytase (/wyo-inositol 
hexakisphosphate phosphohydrolase) belongs to the group of phosphoric monoester 
hydrolases: it catalyzes the hydrolysis of phytate (myo-inositol hexakis phosphate) to 
inorganic monophosphate and lower phosphoric esters of myo-inositol or, in some cases, free 

20 /Tiyo-inositol. Phytases are classified either as 3-phytases or 6-phytases based on the first 
phosphate group attacked by the enzyme. 3 -phytase is typical for microorganisms and 6- 
phytase for plants (Cosgrove, 1980). 

Phytase is either absent or present at a very low levels in monogastric animals (Bitar 
and Reinhold 1972; Iqbal et al 1994). Consequently, dietary phytate is not digested or 

25 absorbed from the small intestine and instead is concentrated in fecal material, thereby 

contributing to phosphorus pollution in areas of intensive livestock production. Runoff fi'om 
animal farms leads to contamination of rivers and streams. Such runoff has resulted in rapid 
drops in the oxygen concentration in rivers and streams due to excessive algal growth in 
water, which, in turn, has led to an increase in the mortality rate of fish and existing flora and 

30 fauna. This is becoming a global problem as pig and poultry production is increased (Miner 
1999;Mallin 2000). Furthermore, phytic acid is viewed as an anti-nutritional factor because it 
interacts with essential dietary minerals and proteins limiting the nutritional values of cereals 
and legxmies in man and animals (Harland and Morris 1995). 
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For the above reasons, various attempts have been made to enable animals to utilize 
available phytate in feed. Such attempts have included production of low phytate plants 
(Abelson 1999), addition of phytase to the animal feed (Simons et al 1990) (Stahl et al. 
1999) or transformation of the fodder plants to produce the required phytase (Pen et al 1993, 
5 Verwoerd et al. 1995). A combination of these options, the feeding of phytase to poultry 
receiving low phytate com has also been tested (Huff et al 1998). However, these solutions 
increase the cost of animal production- Also because phytase is an enzyme, it is susceptible 
to inactivation by heat and moisture and is generally imstable at the high temperatures used 
for feed pelleting. 

10 The primary phytase used for supplementing animal feeds is from Asperigillus sp,; 

however, phytases are produced by a large number of plants and microorganisms (Wodzinski 
and Ullah 1996) (Dvorakova 1998). A phytase produced by Escherichia coli has been 
reported to exhibit the highest activity of those reported (Wodzinski and Ullah 1996). This 
phytase from E. coli was initially cloned as an acid phosphatase gene that was designated 

15 APPA (Dassa et al 1990). Greiner et al. (1991; 1993) purified phytase from E, coli and 
reported that some of the kinetic properties of the acid phosphatase activity of the native 
phytase of coli were similar to those of the ^P^-encoded acid phosphatase. However, 
the authors did not clone the phytase gene to prove that it was identical to APPA gene. We 
have subsequently cloned, overexpressed and characterized APPA gene, and shown that the 

20 E. coli gene APPA codes for a bifunctional enzyme exhibiting both phytase and acid 

phosphatase activities (Golovan et al 2000). Phytases exhibit phosphatase activity, however 
the relative activities differ widely among enzymes (Wodzinski and Ullah 1996). 

Therefore, there is a need for an improved method of allowing access by animals to 
phytase so as to enable efficient phytate metabolism and, thereby reducing phosphate 

25 pollution. 

In the field of protein production using recombinant methods, one of the associated 
problems relates to the lack of required glycosylation. Therefore, a method of producing 
such glycoproteins is also needed. 



30 SUMMARY OF THE INVENTION 

In one embodiment, the invention provides a transgenic non-human animal that 
carries in the genome of its somatic and/or germ cells a nucleic acid sequence including a 
heterologous transgene construct, the constmct including a trangene encoding a protein, the 



BNSDOCID: <WO__0064247A1_L> 



« 



wo 00/64247 PCT/CAOO/00430 

3 

transgene being operably linked to a first regulatory sequence for salivary gland specific 
expression of the protein. 

In another embodiment, the invention provides a transgenic non-human animal that 
carries in the genome of its somatic and/or germ cells a nucleic acid sequence including a 
5 heterologous transgene construct, the construct including a trangene encoding phytase or a 
homologue thereof. 

In yet another embodiment, the invention provides a method of expressing a protein, 
the method comprising the steps of: 

a) introducing a transgene constmct into a non-human animal embryo such that a non- 
10 human transgenic animal that develops fi-om the embryo has a genome that comprises the 
transgene construct, wherein the transgene construct comprises: 

i) a transgene encoding the protein, and 

ii) at least one regulatory sequence for gastrointestinal tract specific expression 
of the protein, 

15 b) transferring the embryo to a foster female; and, 

c) developing the embryo into the transgenic animal 
wherein the transgene is produced in the gastrointestinal tract of the animal. 

In a further embodiment, the invention provides a transgenic animal adapted for 
expressing a protein according to the above method. The invention also provides for the 
20 progeny of such animal. 

In another embodiment, the invention provides a process for producing a protein 
comprising the steps of: 

a) obtaining saUva containing the protein firom a non-human transgenic animal, the 
animal containing within its genome a transgene construct, wherein the transgene construct 
25 comprises: 

i) a transgene encoding the protein, and 

ii) at least one regulatory sequence for salivary gland specific expression of 
the protein, and 

extracting the protein from the saliva. 
30 In a further embodiment, the invention provides a method for expressing a phytase or 

a homologue thereof in a non-human animal, the method comprising: 

a) constructing a nucleic acid sequence including a transgene construct comprising: 
i) a transgene encoding the phytase or a homologue thereof, and 



BNSDCCID: <WQ_0064247A1 I > 



wo 00/64247 



4 



PCT/CAOO/00430 



ii) at least one regulatory sequence for gastrointestinal tract specific expression 

of the protein, and 
b) transfecting the animal with the nucleic acid sequence; 
whereby the animal carries within the genome of its somatic and/or germ cells the transgene 
construct and wherein the animal expresses the phytase or a homologue thereof in its 
gastrointestinal tract. 

In another embodiment the invention provides a nucleic acid molecule comprising a 
nucleic acid sequence including a gene encoding a protein, the gene being operably linked to 
at least one regulatory sequence for gastrointestinal tract specific expression of the protein. 

In another embodiment the invention provides an antibody specific to the protein 
expressed by the above nucleic acid sequence and a test kit for immunologically detecting 
such protein. The invention also provides for hybridomas secreting such antibodies. 

In another embodiment the invention provides cells that are transfected with the above 
nucleic acid sequence. 

In another embodiment, the invention provides a method for producing a protein 
molecule comprising a glycosylated protein secreted in the saliva that exhibits a novel 
physiological activity. One example of such an activity is phytase. 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features of the preferred embodiments of the invention will become 
more apparent in the following detailed description in which reference is made to the 
appended drawings wherein: 

Figure 1 is a schematic diagram representing a method for producing the gene 
construct of the present invention containing the inducible proline-rich protein (PRP) 
promoter/enhancer. More specifically. Figure 1 is a schematic diagram illustrating the steps 
in the construction of the transgenes R15/APPA+intron and R15/APPA used for the 
generation of transgenic mice. 

Figure 2 is a schematic diagram representing a method for producing the gene 
construct of the present invention containing the SV40 promoter. More specifically. Figure 2 
is a schematic diagram illustrating the steps in construction of the plasmid containing the 
transgene SV40/APP A+intron that was introduced by transfection into mammalian cell lines. 

Figure 3 is a schematic diagram representing a method for producing the gene 
.^construct of the present invention containing the constitutive parotid secretory protein (PS?) 
promoter/enhancer. More specifically. Figure 3 is a schematic diagram illustrating the steps 
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in construction of the transgenes Lama2/APP A that codes for the native AppA phytase and 
the Lama2/PSP/APP A that codes for the AppA phytase with the PSP signal peptide sequence. 

Figure 4 is a schematic diagram of the Lama2-APPA plasmid containing the APPA 
transgene. 

5 Figure 5 illustrates the nucleic acid sequence of the Lama2/APPA plasmid containing 

the E, coli APPA gene (SEQ ID NO: 1). 

Figure 6 illustrates the PGR results for transformed mice. More specifically, figure 6 
is a picture of an agarose gel illustrating APPA PGR products firom genomic tail DNA of 
third generation offspring from the transgenic female founder mouse 3-1 generated using the 

10 Xho\ and Noil fragment of the Lama2/APPA construct. A second generation phytase gene 
positive male was crossed with each of two phytase positive transgenic females 9f and 1 If 
(Table 3). From litter 18m x 9f offspring 3, 4, 5 & 6 are PGR positive and from litter 18m x 
I If offspring 2 and 3 are PGR positive. Std is the oligonucleotide standard and the numbers 
on the left are the bp sizes of the standard. Lane C is a negative control reaction mixture that 

15 lacks a DNA template and appA is a positive control containing an amplified segment of the 
ph5(tase gene. The primers used were APP A-UP2 and APPA-KPN. 

Figure 7 illustrates the PGR results for transformed founder pigs. More specifically, 
Figure 7 is a picture of an agarose gel illustrating phytase gene PGR products and p-globin 
PGR products from genomic tail DNA of five founder piglets from litter 167. Std is a 1 kb 

20 ladder. Lane 2 using the phytase primer set is positive for the phytase gene, and all of the 
samples were positive for the (i-globin gene. Lane C is a negative control not containing 
template DNA. The phytase transgene primer set included APPA-UP2 and APPA-KPN gave 
an expected fragment size of 750 bp. The primer set for the p-globin gene included PIG-BGF 
and PIG-BRG gives an expected fragment size of 207 bp. 

25 Figure 8 illustrates the PGR results for transgene rearrangement tests. More 

specifically. Figure 8 is a picture of an agarose gel showing the PGR products of four 
separate primer sets used to amplify different segments of the transgene introduced into pig 
167-02. The Std contained a kilobase DNA ladder. The primers used included lane 1, APPA- 
UP2 and APPA-KPN (750 bp); lane 2, APPA -MATURE and APPA-KPN (1235 bp); lane 3 

30 APPA MATURE and APPA-DOWN2 (608 bp); lane 4, PIG-BGF and PIG-BGR (207 bp), 
lane 5, a negative control without DNA template added; lane 6, the appA gene & primers 
APPA-UP2 and APPA-KPN. The numbers on the left indicate the sizes of the bands in the 
standard. No PGR products were detected in the absence of either DNA template or primers. 
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Figure 9 illustrates weight and salivary phytase activity of the transgenic boar 167-02 
and average weight of the pen-mates at intervals during growth. Symbols: Weight of 167-02, 

Average weight ± SD of four penmates. A; phj^ase activity of 167-02, ■; Phytase 
specific activity, □. Arrows indicate sampling for fecal phosphorus concentration. 
5 Figure 10 illustrates weight and salivary phytase activity of the transgenic boar 282- 

02 and average weight of the pen-mates at intervals during growth. Symbols: Weight of 282- 
02, ; Average weight ± SD of five penmates, A; phytase activity of 282-02, ■; Phytase 
specific activity, □. Arrows indicate sampling for fecal phosphorus concentration. 

Figure 1 1 illustrates weight and salivary phytase activity of the transgenic boar 282- 
10 04 and average weight of the pen-mates at intervals during growth. Symbols: Weight of 282- 
04, Average weight ± SD of five penmates. A; phytase activity of 282-04, ■; Phytase 
specific activity, Arrows indicate sampling for fecal phosphoms concentration. 

Figure 12 illustrates weight and salivary phytase activity of the transgenic boar 405- 
02 and average weight of the pen-mates at intervals during growth. Symbols: Weight of 405- 
15 02, Average weight ± SD of four penmates. A; phytase activity of 405-02, ■; Phytase 
specific activity, □ . Arrows indicate sampling for fecal phosphorus concentration. 

Figure 13 illustrates weight and salivary phytase activity of the transgenic boar 421- 
06 and average weight of the pen-mates at intervals during growth. Symbols: Weight of 421- 
06, •; Average weight ± SD of four penmates. A; phytase activity of 421-06, ■; Phytase 
20 specific activity, Arrows indicate sampling for fecal phosphoms concentration. 

Figure 14 illustrates the PGR results of first generation pigs. More specifically. 
Figure 14 is a picture of an agarose gel showing the PGR analysis of eight liter 154 piglets. 
The phytase transgenic boar 167-02 was used to breed a non-transgenic female. Std, 100 bp 
ladder, numbers on left are the sizes of the firagments in each band in bp; lane 167-02, DNA 
25 fi-om boar 167-02 1, DNA from 167-02; lane Q is a lane without added DNA; lanes 1-8, are 
amplified DNA inserts from each of the offspring piglets of the litter. Phytase primers were 
Lama-UP and APPA-DOWN4. (5-globin primers were PIG-BGF and PIG-BGR. 

Figure 15 illustrates a sodium dodecylsulfate gel stained with silver demonstrating the 
sizes of the E, coli produced APPA phytase and the APPA phytase produced by the pig and a 
30 demonstration that the pig phytase is glycosylated. More specifically. Figure 15 is a picture 
of a sodium dodecylsulfate polyacrylamide gel electrophoresis (SDS-PAGE) profile of the 
purified AppA phytase produced in E. coli and the purified pig salivary phytase stained 
directly with silver (A) and a transfer firom a similar SDS-PAGE gel transferred to 
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nitrocellulose and stained for glyoproteins (B). Creatinase is not glycosylated while 
transferring is glycosylated. The numbers on the left are the masses in of the molecular mass 
standards (Std) expressed in kDa. 

Figure 15B is a picture of Western blot of the untreated pig AppA phytase and the 
5 same phytase after treatment with a combination of three deglycosylating enzymes- Lane 1, 
Purified AppA phytase produced in E, coli (untreated); lane 2, purified pig phytase 
(untreated); lane 3, purified pig phytase treated with the combination of deglycosylating 
enzymes including N-glycosidase F, O-glycosidase and neuraminidase. 

Figure 16 illustrates a Western blot of the pig phytase and the £. coli produced APPA 

10 phytase using monoclonal antibodies directed to the APPA phytase documenting that they 
have homologous epitopes. More specifically, Figure 6 is a Western blot of the AppA 
phytase fi-om pig saliva after various purification steps and of purified phytase produced in E, 
colL A monoclonal antibody prepared against the E. coli phytase was used as the primary 
antibody for detection. Lane 1, saliva firom non-transgenic pig 164-04; lane 2, saliva firom 

15 transgenic pig 167-02; Lane 3, saliva fi-action not bound to DEAE-Sepharose; lane 4, 
salivary phytase bound to DEAE-Sepharose and released with an NaCl gradient; lane 5, 
salivary phytase fiulher purified by Chromatofocusing with a pH gradient of 4 to 7; lane 6, 
phytase purified from E. coli. The numbers on the left are the masses of molecular mass 
standards (not shown) expressed in kDa. 

20 Figure 1 7 illustrates an SDS-Page of the E. coli APPA phytase, saliva samples firom 

phytase negative and positive pigs and mice and a corresponding Western blot documenting 
that phytases firom all three sources have homologous antigenic epitopes, but the animal 
phytases are larger than that produced in E. coli. More specifically. Figure 6 is a SDS-PAGE 
profile of the purified E. coli produced AppA phytase and the AppA phytases produced by 

25 pigs and mice stained with silver (A) and a Western blot of an identical set of protein samples 
(B), A polyclonal antibody prepared against the E, coli phj^tase was used as the primary 
antibody for detection. Lane 1, Purified AppA phytase produced in E. coli; lane 2, Saliva 
fi-om a non-transgenic pig 164-01; lane 3, Saliva from a AppA producing transgenic pig 167- 
02; lane 4, Purified phytase from pig 167-02; lane 5, Saliva from a non-transgenic mouse; 

30 lane 6, Saliva from a transgenic mouse containing R15/APPA transgene induced with 

isoproterenol; lane 7, Saliva from a transgenic mouse containing the Lama/APPA transgene; 
Std, Molecular mass markers. The numbers on the left are the masses of molecular mass 
standards (not shown) expressed in kDa. 
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Figure 18 illustrates the nucleic acid sequence of the known segment of the 
R15/APPA + intron plasmid including the vector sequences of pBLCATS (SEQ ID NO:2). 

Figure 19 illustrates the nucleic acid sequence of the known segment of the 
RI5/APPA + intron transgene construct used for the generation of transgenic mice (SEQ ID 
5 NO:3). 

Figure 20 illustrates the nucleic acid sequence of the known segment of the 
R15/APPA plasmid including the vector sequences of pBLCAT3 (SEQ ID NO:4). 

Figure 21 illustrates the nucleic acid sequence of the known segment of the 
R15/APPA transgene constmct used for the generation of transgenic mice (SEQ ID NO:5). 
10 Figure 22 illustrates the nucleic acid sequence of the SV40/APPA + intron plasmid 

(SEQ ID NO:6). 

Figure 23 illustrates the nucleic acid sequence of the Lama2/APP A transgene 
construct used for the generation of transgenic mice and transgenic pigs (SEQ ID NO: 7). 

15 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In the following description, a number of recombinant DNA technology terais are 
used. The following definitions have been provided in order to enable a clearer 
imderstanding of the specification and appended claims: 

"Promoter" - a DNA sequence generally described as the 5' region of a gene and 

20 located proximal to the start codon. The transcription of an adjacent gene is initiated at the 
promoter region. If a promoter is an inducible promoter then the rate of transcription 
increases in response to an inducing agent. A constitutive promoter is one that initiates 
transcription of an adjacent gene without additional regulation. 

"Operably Linked" - a nucleic acid sequence is "operably linked" when placed into a 

25 functional relationship with another nucleic acid sequence. For instance, a promoter or 

enhancer is "operably linked" to a coding sequence if the promoter causes the transcription of 
the sequence. Generally, operably linked means that the linked nucleic acid sequences are 
contiguous and, where it is necessary to join two protein coding regions, contiguous and in 
one reading frame. 

30 "Phytase'' - any protein that liberates phosphate from myo-inositolhexakis-phosphate 

or other inositol phosphates. Its catalytic capability may be limited to phytic acid or one of 
its salts, or it may show less specificity and hydrolyze a variety of phosphorylated 
corapo.ynds. 
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"Gene" - a DNA sequence that contains a template for an RNA polymerase ana 
contains infomiation needed for expressing a polypeptide or protein. 

"Polynucleotide Molecule" - a polydeoxyribonucleic (DNA) acid molecule or a 
polyribonucleic acid (RNA) molecule. 
5 "Expression" - the process by which a polypeptide is produced from a structural gene. 

"Cloning vehicle" - is a plasmid or phage DNA or other DNA sequence which is 
capable of carrying genetic information into a host cell. A cloning vehicle is often 
characterized by one or more endonuclease recognition sites at which such DNA sequences 
may be cut in a determinable fashion without loss of an essential biological function of the 
10 vehicle- A cloning vehicle is a DNA sequence into which a desired DNA may be spliced in 
order to bring about its cloning into the host celL 

"Vector" - is a term also used to refer to a cloning vehicle. 

"Plasmid" - is a cloning vehicle generally comprising a circular DNA molecule that is 
maintained and replicates autonomously in at least one host cell. 
15 "Expression vehicle" - a vehicle or vector similar to a cloning vehicle but which 

supports expression of a gene that has been cloned into it, after transfomiation of a host. The 
cloned gene is usually placed under the control of (i.e. is operably linked to) certain control 
sequences such as promoter sequences. 

"Host'' - a cell that is utilized as the recipient and carrier of recombinant material. 
20 "Homologous*' - refers to a nucleic acid molecule that originates from the same genus 

or species as the host. 

"Heterologous" - refers to a nucleic acid molecule that originates from a different 
genus or species than that of the host. 

"Glycoprotein" - refers to a peptide molecule that has undergone glycosylation. 
25 "Glycosylation" - refers to the addition of carbohydrate groups to a amino acid 

residues of a peptide molecule. 

In recent years, transgenic animals have been developed for many purposes (Pinkert 
et al, 1990) (Wall et al 1997). One premise, therefore, for the present invention is that by 
providing a transgenic animal capable of expressing phytase, the problems discussed above 
30 would be obviated. The options for heterologous phytase expression in animals include (i) 
salivary gland secretion of a phytase, (ii) pancreatic secretion of the enzyme into the small 
intestine along with the digestive enzymes, or (iii) secretion from the intestinal epithelial cells 
much like that of indigenous alkaline phosphatase and glycosidases (Low, 1989). The£. coli 
phytase would appear to be best suited for hydrolytic activity in the monogastric stomach 
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because the enzyme has a pH optimum in the range of 2.5 to 4.5 and it is resistant to pepsin, the 
predominant protease active in the stomach. The phytase has a periplasmic location in E. coli 
and has an N-terminal signal peptide sequence (Golovan et al., 1999) that seemed optimally 
adapted for secretion from the parotid gland. Phytase could be expressed in either the pancreas 
5 for secretion into the small intestine or it could be expressed in the intestinal epithelial tissue 
and secreted into the intestinal milieu. However, if these choices of expression locations 
were chosen, it would be necessary to select an enzyme active at the more neutral pH of the 
small intestine and one which was more resistant to pancreatic enzymes including trypsin, 
chymotrypsin and elastase. 

10 Factors of importance in terms of the expressed enzyme when selecting a phytase for 

expression in the gastrointestinal tract include a pH that is optimum for activity, high 
catalytic activity, broad substrate specificity, and protease resistance. If any of these 
properties, or indeed others, is not acceptable, there are now sophisticated molecular methods 
for modifying the properties of an enzyme. These include site directed mutagenesis, random 

15 mutagenesis and various modifications of DNA shuffling (Harayama, 1998; Crameri et al., 
1998). 

Synthesis of phytase in the salivary gland and secretion in the saliva would, therefore, 
provide for early contact of the enzyme with phytic acid present in the feed and provide 
sufficient time for hydrolysis. 

20 The salivary gland system of the pig consists of three pairs of glands, the parotid gland, 

which secretes through a duct on each cheek, and mandibular and submaxillary glands that have 
joint ducts that secrete beneath the front on the tongue. Saliva secreted in the pig via these ducts 
is discontinuous and is produced during consumption of solid foods, and can equal the weight of 
food consumed when water is limited during feed consumption (Corring, 1980; Arkhipovets, 

25 1 956). For example, the quantity of saliva produced by a 45 kg pig can vary from near zero 

when the pig receives a mainly liquid diet to 500 g when a dry diet is consumed without access 
to water. The salivary glands of the pig secrete amylase (Rozhkov and Galimov, 1990) and a 
variety of other salivary proteins and mucopolysaccharides. 

To our knowledge no porcine genes coding for salivary proteins have been cloned. 

30 However, genes coding for major proteins secreted by the rat and mouse have been cloned and 
characterized. A multigene family encoding a group of unique proteins high in proline, the so- 
called proline-rich proteins (PRPs) are produced when either mice or rats consume tannins or are 
injected with isoproterenol. 
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It would be advantageous to develop an animal that is transformed to express phytase, 
preferably in the salivary gland. In such case, the phytate naturally occurring in the animal 
feed can be utilized by the animal without any additives being used. This will decrease the 
cost of animal production, and furthermore, will avoid polluting the environment with 
5 phosphoms. Therefore, the present invention aims to overcome the deficiencies of the prior 
art relating to increasing phytate utilization and, particularly, to provide transgenic animals 
which express phytase. 

In the production of heterologous proteins by means of recombinant methods, several 
hurdles have been faced. One such htirdle that is often faced is the lack of required post- 
10 translational modification of the expressed protein thereby resulting in a protein that is 

structurally and/or functionally, different firom the desired molecule. Glycosylation is one 
such post- translational modification that is desired. However, such modification is generally 
found to occur in more complex mammalian systems. Therefore in one embodiment of the 
present invention there is provided a method of producing recombinant glycoproteins. 
15 In one embodiment, the present invention provides an animal capable of inducible or 

constitutive salivary expression of a heterologous protein. To illustrate this, the mouse was 
chosen as the animal model and the gene constructs used for transformation were created 
using the rat proline-rich protein (PRP) promoter/enhancer (inducible promoter) and the 
mouse parotid secretory protein (PSP) promoter/enhancer (constitutive promoter). In this 
20 illustration, phytase was used for expression in saliva. 

After finding that an inducible phytase could be expressed in the parotid gland 
of mice the expression of the phytase transgene under the control of the constitutive PSP 
promoter was then tested. Two mice transgenic for the PSP constmct were produced under 
contract at the University of Alabama. 
25 Following the testing of the mice described above, transgenic pigs were developed by 

introduction into the genome a phytase transgene consisting of a constitutive promoter 
driving the synthesis of a highly active phytase. The pigs so generated were found to excrete 
less phosphorus in their feces than non-transgenic pigs. 



30 Expression in the Salivar\^ Glands 

Saliva is a clear colorless fluid secreted by major salivary glands (parotid, 
submandibular, sublingual and minor salivary) that lubricates and cleans the oral structure, as 
well as initiates the process of digestion. The parotid glands are two of six major glands 
associated with the production of saliva. The parotid gland is composed mainly of two cell 
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types: acinar and interglobular duct cells. The acinar cells, which represent 75 to 85?oof the 
tissue, are the sites of secretory protein synthesis (Frandson and Spurgeon 1992). Two very 
abundant proteins are produced by these cells: a-amylase (AMY-1) (2% of polyA RNA) 
(Madsen and Hjorth 1985), and parotid secretory protein (PSP) (10% of polyA RNA) (Shaw 
5 and Schibler 1986). Several constructs are now available which allow tissue-specific 
expression of a transgene in the salivary glands of mice. 

The salivary secretion in pigs has not received the attention given to that of mice and 
humans. It was suggested that salivary secretion is discontinuous (less secreted between 
periods of meal consumption). Up to 500 g of saliva may be secreted by a 45 kg pig upon 

10 consumption of 500 g of dry feed (Coning 1980). Wide variations were detected in both the 
flow rate and electrolytes in saliva between animals and even between samples taken from 
the same animal on separate days (Tryon and Bibby 1966). Very little is known about the 
composition of pig's saliva or salivary enzymes. Salivary amylase was detected, although the 
quantity was 250 000 times less than that of pancreatic amylase, and 100 times less than in 

15 human saliva (Low 1989). There are no constructs known which would allow salivary gland- 
specific expression of transgene in pigs. 

I) APPA Gene Under Control Of An Inducible Promoter 

20 1) Construction of R15/APPA constructs (Inducible Promoter) 

In this process, a plasmid is constmcted by linking a promoter/enhancer for a saliva 
protein with the APPA gene, which codes for the bifunctional phytase, acid phosphatase. The 
APPA gene used in this construction was cloned from E, coli ATCC 33965 into pBR322. 
This is described above (Golovan et al., 2000). 

25 Proteins, unusually high in proline, the so-called proline-rich proteins (PRPs), 

comprise about 70% of the total proteins in human saliva (Bennick 1982). Unlike the 
constitutive expression of the PRPs in humans, the salivary glands of mice, rats and hamster 
normally either do not express PRPs or express them in low levels. In the rat and mouse, 
PRP gene expression can be dramatically induced by diets high in tannins or by injection 

30 with the P-agonist isoproterenol (Carlson 1993). After 6 to 10 days of daily isoproterenol 

injection the PRPs comprised about 70% of the total soluble protein in parotid gland extracts. 
PRP cDNA and PRP genes have been cloned and characterized from rats (Clements et al 
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1985), mice (Ann and Carlson 1985), hamsters (Mehansho et aL 1987), and humans (Kim 
andMaeda 1986). 

Transgenic mice were used to locate the cis-acting DNA elements that are essential 
for salivary-specific and inducible expression of the rat proline-rich protein gene, R15. It was 
5 found that a parotid control region (-6 to -1 .7 kb) upstream of the R15 promoter is capable of 
directing parotid-specific and isoproterenol-inducible expression of a heterologous promoter 
construct (Tu et aL 1993). The distal -10 to -6 kb region was shown to function as an 
enhancer, which can increase levels of expression more than 30-fold. The -6 to -1.7 kb 
region also seems to function as a locus control region (LCR), because it conferred copy 
10 number-dependent and chromosomal position-independent expression of a reporter gene in 
15 out of 15 independent transgenic mice (Tu, Lazowski, Ehlenfeldt, Wu, Lin, Kousvelari, 
and Ann 1993). 

We obtained the R15-PRP promoter from Dr. DX. Aim as a plasmid AORIS/ CAT, 
which placed the chloramphenicol acetyltransferase gene (CAT) under control of the 

15 inducible R15-PRP promoter- We decided to use the plasmid as a basis for transgene 
constmction (Figure 1). Due to the absence of complete sequence information about the 
R15-PRP promoter (only 2 kbp out of 10 kbp was sequenced) we removed the R15-PRP 
promoter by Xho I digestion (Figure 1, step 1). Re-ligated plasmid was used as a template 
for PGR with CAT-ATG and CAT-TAA synthetic primers. The 4.3 kbp CATpcr fragment 

20 had the initiation site of the CAT gene substituted with the optimal eukaryotic initiation 

sequence (Kozak 1987). The fragment was purified by agarose gel electrophoresis, re-ligated 
to itself and used to transform E. coli (Figure 1, step 2). The CATpcr plasmid was digested 
with Nco I and fiUed-in using T4 DNA polymerase to generate a blunt end After that, the 
CATpcR fragment was digested with Eco47in and purified by agarose gel electrophoresis 

25 (Figure 1, step 3). Three rare codons in the APPA gene were modified during the sub-cloning 
steps leading to the constmction of the transgene. Specifically, the Alas coding sequence was 
changed from GCG to GCC, the Pro428 sequence was changed from CCG to CCC, and the 
Ala429 sequence was changed from GCG to GCT. This modification was made in order to 
increase the possibility of transcription of the gene in eukaryotic cells. The APPA gene was 

30 amplified by PCR using the previously cloned APPA gene from the pBKMHAPPA plasmid 
with the synthetic primers APPA-DRA and APPA-SMA. The 1.3 kbp APPApcr fragment 
generated by PCR was digested with Dra I and Sma I and gel-purified (Figure 1, step 4). 
APPApcaand CATpcr fragments were blunt end ligated to produce CAT/APPA+intron 
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vector (Figure 1, step 5), which was introduced into a DH5a strain ofE. coli. The insert 
orientation was checked by restriction digest with Sal I and EcoR L The transgene in 
CAT/APPA+intron was checked by sequencing both strands. To remove the SV40 small t 
intron the 2.3 kbp -4PP-/4/intron/polyA fragment was excised fix)m a plasmid by Xho I and 
5 EcoR I digestion (Figure 1, step 6a), gel purified and digested by Dra I (Figure 1, step 6b). 
The 1.5 kbp (APPA) and 0.2 kbp (polyA) fragments were gel-purified and linked together in 
three way ligation with CATpcr digested with Xho I and EcoR I (Figure 1, step 6c). The 
resulting plasmids CAT/APPA and CAT/APPA+intron were digested with Xho I, gel- 
purified and re-ligated with R15-PRP promoter digested with Xho I (Figure 1, step 7). 

10 Because of the low efficiency of ligation the whole ligation mixture was used to transform 
E.colU total plasmid DNA was prepared and run on the agarose gel. Plasmids which were 
larger than the original CAT/APPA (5.6 kbp) were eluted and re-transformed inE.colu 
Plasmids with the R15-PRP insert (15 kbp) were identified by electrophoresing DNA fi-om a 
single colony on an agarose gel. The correct orientation was identified by PCR with R15- 

15 UP 1 and APPA-DO WN2 synthetic primers. The plasmids Rl 5/APP A and Rl 5/APP A+intron 
were both digested with Hind III and Kpn I; transgenes were gel-purified and fiirther purified 
using a Qiagen column (Figure 1, step 8). 

Figure 18 illustrates the nucleic acid sequence for the plasmid containing the known 
segment of the Rl 5/APP A + intron sequence including the vector sequences of pBLCAT3. 

20 The sequence of this plasmid is designated as SEQ ID NO:2. 

Figure 19 illustrates the nucleic acid sequence for the transgene constmct containing 
the known segment of the Rl 5/APP A + intron sequence used for the generation of transgenic 
mice. The sequence of this transgene is designated as SEQ ID NO:3. 

Figure 20 illustrates the nucleic acid sequence for the plasmid containing the known 

25 segment of the Rl 5/APP A sequence including the vector sequences of pBLCAT3. The 
sequence for this plasmid is designated as SEQ ID NO:4. 

The pBLCAT3 sequence indicated above is present in the CAT/APPA of Figure 1 and 
in the CAT/APPA+intron of Figure 2. This sequence was part of the original -10R15/CAT 
and a portion of it was carried through in the constmction process. 

30 Figure 21 illustrates the nucleic acid sequence for the transgene construct containing 

the known segment of the Rl 5/APP A sequence used for the generation of transgenic mice. 
The sequence of this transgene is designated as SEQ ID N0:5. 
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2) Expression of SV40/APPA+intron in Cell Culture 

To produce an SV40/APPA plasmid for expression of APPA in cell culture, the SV40 
promoter/enhancer was amplified by PGR from the pSV-p-galactosidase plasmid (Promega) 
using the synthetic primers SV-HEND and SV-XHO. The SV40 promoter/enhancer fragment 
5 was digested with Xho I and Hind m, gel purified, and ligated into CAT/APPA digested with 
Xho I and Hind III (Figure 2). 

Figure 22 illustrates nucleic acid sequence for the S V40/APPA + intron. The 
sequence for this plasmid is designated as SEQ ID NO:6. 

We obtained a rat parotid acinar cell line PARC 5.8 (Quissell et aL 1998) that we 

10 intended to use for transient expression of the phytase transgene. The purpose was to test the 
efficiency of different constructs for transgene expression and also to detect any deleterious 
effects of phytase expression before introduction into the animals. We tried transient 
expression of the APPA gene using R15/APPA and R15/APPA+intron constructs but because 
of low transfection efficiency and/or low expression levels, we were unable to detect either 

15 phytase or P-galactosidase that we used as a control for transfection. 

We exchanged the R15-PRP inducible promoter from the R15/APPA constmct with 
the SV40 constitutive promoter-enhancer, which enables high level transient expression in 
different cell cultures. CHO, COS 7 and HELA cell lines were screened for transient 
expression of the APPA phytase using the S V40 promoter/enhancer. AU cell lines were 

20 maintained on DMEM/F12 (Sigma) cell medium with 10 % (wt/vol) heat-inactivated fetal 
bovine serum at 3TC in 5% CO2 and 95% air. Cells were grown to 70 % confluence before 
transfection. Two hours before transfection the medium was exchanged with fresh medium. 
Cells were transformed with 5 }ig of DNA per 60 nun culture plate (1:1 SWAQIAPPA and 
SV40/p-galactosidase) using the DNA-Calcium-Phosphate method of transfection (Gorman 

25 et aL 1983). After 6 hours of incubation the medium was removed and cells were subjected to 
glycerol shock for 3 min (Ausbel et al. 1992). Cells were washed with phosphate-buffered 
sahne (PBS) and incubated in fresh medium under standard growth conditions. After 48 
hours of incubation cell-free culture fluid was collected, the cells washed two times with PBS 
and lysed with I ml of 1% (vol/vol) NP-40, ImM disodium EDTA in Hanks balanced salts 

30 (HBSS) for 1 hour at 4^C. The ph>tase assay was performed in a final volume of 100 jil of 
0.1 M sodium acetate/acetic acid buffer (pH 4.5) using sodium phytate (4 mM) as a substrate 
at 37°C. After 6 hours of incubation the reaction was stopped with 67 [il ammonium 
molybdate/ammoniunrvanadate/nitric acid mixture and the concentration of liberated 
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inorganic phosphate determined at 405 ran (Engelen et al 1994). One unit (U) of enzyme 
activity was the amount of the enzyme releasing 1 lomol inorganic phosphate per minute. The 
assay was performed in triplicate. As a control for endogenous phytase activity, non- 
transfected cell lines were used. 
5 We did not detect endogenous phytase activity in non-transfected cell lines. Phytase 

activity was detected in all transfected cell lines, with COS 7 cells expressing a total of 0.35 U 
of phytase in cell-free culture fluid (4 ml) and 0.0034 U in the cell fraction (1.1 ml) obtained 
from the same plate. The phytase activity produced by COS7 cells was 7 times higher than 
that of CHO and 35 times more than the HELA cell line. More than 99% of activity was 
1 0 located in cell-free culture fluid, which suggests that the expressed enzyme was exported out 
of the cell using the bacterial signal sequence. We were imable to detect expression of 
cytoplasmic P-galactosidase, which we wanted to use as a control for transfection efficiency. 



3) Expression of R15-PRP/APPA in Transgenic Mice 

15 Transgenic mice were generated using the constructs R15/APPA and 

R15/APPA+intron by Dr. C.A. Pinkert at the NICHD Transgenic Mouse Development 
Facility (NTMDF), University of Alabama at Birmingham, Alabama. The procedures 
followed in generating the mice have been standardized by the NTMDF and further 
information concerning this can be obtained at: http://transgenics.bhs.uab.edu/pagel.htm, the 

20 content of which is incorporated herein by reference. This procedure involved the 

microinjection technique for transfecting mice with the desired nucleic acid sequence. To 
summarize, the sequences are microinjected into mouse zygotes and the surviving eggs are 
implanted into pseudopregnant recipient mice. The recipient mice then give birth to the 
resulting founder transgenic mice. It will be appreciated that various other methods of 

25 generating transgenic mice may be used in the present invention. 

The R15/APPA transgene in mice was detected by PCR using the primers CAT-UP 1 
and APPA-DOWN2 that gives rise to a 700 bp fragment using the standard PCR conditions, 
except that the hybridization step was set at 51^C for 40 seconds and the polymerization step 
was at 72°C for one minute. 

30 For the R15/APPA construct 8 PCR positive founder mice were obtained of which 4 

were males and 4 were females. Three of the founders did not pass the transgene to progeny 
and were probably mosaics. For R15/APPA+intron 5 PCR positive founder mice were 
obtained, 3 were males and 2 were females, and one of them was found to be mosaic. At 10 
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to 12 weeks of age PRP production in the PCR positive progeny from different lines was 
induced for 1 0 days by daily intraperitoneal (ip) injection of Img isoproterenol dissolved in 
1 00 l^^ sterile saline. To serve as a control several PCR negative progeny were also induced. 
No significant differences in weight were noticed between PCR positive and PCR negative 
5 progeny at either the beginning or end of the induction period. Saliva was collected before 
induction and at the end of the 10 day induction period. 

To collect saliva, mice were lightly anesthetized with a ketamine/xylazine mixture (ip 
injection of 50 mg ketamine and 5 mg xylazine per kg body weight diluted in water) and 
saliva flow was induced by injection with pilocarpine/isoproterenol (ip injection of 0.5 mg 

10 pilocarpine and 2 mg isoproterenol per kg body weight dissolved in saline) (Hu et al 1992). 
Between 100-250 p.1 of saliva was collected from each mouse over a 30 min period begiiming 
5 min after the pilocarpine/isoproterenol injection. 

The saliva was collected from each mouse by holding it in one hand and withdrawing 
saliva from the comer of the mouth with a 20 }j.1 pipetter. Collected saliva was transferred to 

15 a cold Eppendorf microcentrifuge tube containing 2 p.1 of 0.5 M EDTA (pH 8.0) and 4 \i\ of 
10 mg/ml protease inhibitor Pefabloc (Boehringer Mannheim) dissolved in water. The tubes 
with saliva were kept on ice until assays were conducted, Phytase activity in the saliva was 
assayed as described for the S V40/APPA expressed in cell culture. 

Phytase expression was not detected in either un-induced or in induced PCR negative 

20 mice. For PCR positive mice, phytase expression was not detected in those that were un- 
induced. However, phytase expression was observed for PCR positive mice that were 
induced. The results of this study are sununarized in Table 1. 

Even though it was possible to distinguish saliva from induced PCR positive from that 
of PCR negative mice in a phytase assay by a characteristic yellow color, saliva from some of 

25 the negative mice, when assayed, produced cloudiness that was impossible to remove by 
centrifiigation and that affected spectrophotometer readings. We did not notice any gender 
differences in expression, both males and females were found to produce phytase in saliva. 
In three Unes (all RlS/APPA+intron) no phytase expression or very low level of expression 
(0-03-0.95 U/ml) was detected, in 4 lines the level of expression ranged from 7 to 87 U/ml, 

30 and two lines (both R15/APPA) produced very high levels of phytase in saliva, 252 and 547 
U/ml. 

These experiments demonstrated that ph3^ase can be expressed at a very high level in 
the salivary glands of mice^without detrimental effects on the animals. We also were able to 
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produce progeny with an inducible salivary phytase from animals expressing the inducible 
phytase thereby documenting inheritance of the trait, and showing that the reproductive 
capability of animals was not affected. When the F2 generation of mice were tested for 
salivary phytase the level of phytase production was preserved. 
5 Founders containing the transgene without the intron gave offspring that produced 

significantly higher levels of phytase. The SV40 intron in the R15/APPA+intron construct 
seems to cause a lower level of expression, and in three lines (Alf, A20f and BOm) the level 
of phytase was barely detectable. The level of phytase expression in A2m line 
(R15/APPAH-intron) was 6.2 times lower than that of the BOm-intron line (R15/APPA). 
1 0 Preliminary experiments showed that when the enzyme was analyzed by PAGE its 

size was increased from 42 kDa to 60 kDa, It is likely modified by glycosylation, but stable 
and active. 



ID APPA Gene Under Control Of A Constitutive Promoter 

15 

1) Construction of the Lama2/APPA Transgene (Constitutive Promoter) 

The murine parotid secretory protein (PSP) is the most abundantly expressed protein 
in the parotid gland of mice (Madsen and Hjorth 1985). After an hour of pulse labeling, the 
mouse parotid gland incorporates 65 to 85% of ^'^C-leucine into this single protein (Owerbach 
20 and Hjorth 1980). It was estimated that PSP mRNA accumulates up to 50,000 molecules per 
cell and that from 3 to 5 molecules of PSP are produced for every molecule of amylase 
(Madsen and Hjorth 1985). Despite the predominance of the PSP in saliva its fimction is not 
well characterized. 

The single-copy gene coding for PSP has been cloned and characterized. It has two 
25 alleles PSP^ (Shaw and Schibler 1986) and PSP^ (Owerbach and Hjorth 1980). ThePSP^ 
allele is also expressed in the sublingual gland, but at 1/10 of the level found in the parotid 
gland. It was shown that 4.6 kbp of 5' flanking sequence of PSP*^ is sufficient for salivary 
gland specific expression. The level of sublingual expression approached 100% of the PSP 
mRNA level, whereas the parotid expression did not exceed 1% (Mikkelsen et aL 1992), 
30 which demonstrates that regulatory sequences for sublingual and parotid expression are not 
identical. The level of expression was also dependent on the site of integration. The same 
constmct was used for expression of the C-tenninal chain of the human blood coagulation 
factor VIU, FVIU. A high level of FVUl mRNA was detected in the sublingual gland and a 
low level in the parotid gland. The transgenic lines also secreted the FVlll light chain into 
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saliva at the level of about 10 units per salivation (about 0. 05 ml of saliva) (Mikkelsen et 
aL,1992). Later the same group achieved a high level of parotid- specific expression that was 
similar or even exceeded that of the endogenous gene by using 1 1.4 kbp of 5' flanking 
sequences and 2.5 kbp of 3* flanking sequences (Larsen etal. 1994). The expression also 
5 seems to be position-independent and copy-number-dependent that could indicate the 
presence of a LCR in these sequences. 

Lama 2 is a portion of the PSP gene and comprises an 18 kbp construct that is 
expressed in transgenic mice at up to 56% of the endogenous PSP gene. 

Because a large part of Lama 2 had not been sequenced, the construct was first 
10 disassembled and subcloned into pBluescript KS(+) and after incorporation of ih^APPA 
gene, the Lama 2 was reassembled back (Figure 3). We used unique enzymes RsrII and 
Smal to remove a 3.4 kbp fragment from Lama2, which was subcloned into the multiple 
cloning site (MCS) of pBluescript II KS(+) that was previously digested with Kpnl and 
Smal, using a Kpnl -RsrII adapter (Figure 3, step 1). 
15 Kpnl* Rsrn 

TGGGAGGTCG 
CATGACCCTCCAGCCAG 

That allowed us to preserve the RsrII (CG/GWCCG) site and destroy the Kpnl site 
(GGTAC/C> GGTAC/T), which would otherwise interfere with future cloning. The 
20 pKS/Lama construct was digested with Apal and Kpnl and used in a three-way ligation with 
the modified APPA (Figure 3, step 2). We designed two PSP/APPA constructs. One 
construct APPA-signal/APPA (Figure 3, steps 3a-7a) had the original bacterial signal 
sequence from the APPA protein having the following amino acid sequence: 

25 Met-Lys-Ala-Ile-Leu-Ile-Pro-Phe-Leu-Ser-Leu-Leu-Ile-Pro-Leu-Thr-Pro-Gln-Ser-Ala-Phe- 
Ala 

We also modified a sequence near the ATG codon to resemble the optimal 
mammalian Kozak sequence (GCC GCC A/GCC ATG G) (Kozak 1987), but we did not 
30 mutagenize the +4 position because it would change Lys to Glu in the signal sequence with 
possible deleterious consequences for protein export. This optimized sequence was used in 
our previous constmct R15/APPA and led to high levels of phytase production. We checked 
the APPA bacterial signal sequ^ce using the PSORT computer neural network trained on 
eukaryotic signal sequences and further described at http://psort.nibb. ac.jp:8800/ (Nakai and 
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Kanehisa 1992). The APPA bacterial signal sequence was recognized as an efficient leader 
peptide and the cleavage site was correctly predicted. PSORT also predicted that there is a 
high probability that phytase would be exported correctly outside of the cell. There were also 
publications showing that some bacterial signal sequences might function efficiently in 
5 mammalian cells (Williamson et aL 1994) (Hall et al 1990). Our experiments using cell 
culture demonstrated that the APPA signal was correctly processed with export of phytase 
outside of the cell. 

Experiments using cell culture cannot predict the direction of export and if phytase 
were exported into blood vessels instead of salivary ducts that could lead to deleterious 
10 effects. That is why we also designed a second constmct PSP-signal/APPA (Figure 3, steps 
3b-7b) that would preserve the original PSP signal amino acid sequence: 

Met-Phe-Gln-Leu-Gly-Ser-Leu-Val-Val-Leu-Cys-Gly-Leu-Leu-Ile-Gly-Asn-Ser-Glu-Ser 

1 5 This leader peptide was also efficiently recognized by PSORT with the correct 

cleavage site (Nakai and Kanehisa 1 992). In this construct we also preserved the original 
PSP sequences near the ATG start codons, which may not be optimal, but could be important 
in regulation of gene expression. The APPA gene for both constructs was ampUfied by PGR 
using as the template our previous transgenic construct R15/APPA that possessed the optimal 

20 Kozak sequence and the modified codons for residues Ala3, Pro428 and Ala429 as described 
earlier. For the APPA signal/APPA constmct two synthetic primers were used which 
introduced a Clal site near the ATG codon (APPA-CLA) and a Kpnl site near the TAA stop 
codon (APPA-KPN). The APPApcrI product was digested with Clal and Kpnl. The Clal 
site was also introduced into Lama 2 using pKS/Lama 2 as template for PGR. LAMA-UP 

25 primer was located upstream of Apal site and the LAMA-CLA primer introduced the Clal 
site near ATG codon (Figure 3, step 3a). LamapcRl product was digested with Clal and 
Apal (Figure 3, step 4a). pKS/Lama (Apal -Kpnl), LamapcRl (Apal- Clal) and APPApcrI 
(Clal -Kpnl) were combined together in a three-way ligation reaction (Figure 3, step 5a). 
The recovered pKS/Lama/APPA plasmid was digested with RsrII, Smal and inserted back 

30 into Lama2 (Figure 3, step 6a). 

For the PSPsignal/ APPA construct, the synthetic APPA -KPN primer was used with 
the synthetic APPA -MATURE primer, which produced phytase without a signal sequence. 
The APP Apcr2 product was blunt-ended using T4 DNA polymerase and digested with Kpnl . 
The PSP signal sequence was produced using the LAMA-UP and LAMA -SIGNAL primer 
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(Figure 3, step 3b). The LamapcR2 was blunt-ended using T4 DNA polymerase and digested 
with Apal (Figure 3, step 4b). pKS/Lama (Apal-Kpnl), LamapcR2 (Apal-blunt) and 
APPApcr2 (blunt-Kpnl) were combined together in a three-way hgation reaction (Figure 3, 
step 5b). The recovered pKS/Lama/APPA plasmid was digested with RsrII, Smal and 
5 inserted back into Lama2 (Figure 3, step 6b). 

Even though both constructs were successfully produced we decided to use 
Lama2/APPAsignal/APPA for the generation of transgenic mice, because we have results 
from our previous transgenic constructs R15/APPA and R15/APPA+intron which 
demonstrated that phytase with optimized Kozak sequence and the APPA signal peptide was 
1 0 synthesized at a high level in salivary glands after induction and was efficiently exported into 
the salivary duct. The Lama2/APP A vector was digested with Xhol and NotI, and the 
transgene was gel-purified and further purified using a Qiagen column (Figure 3, step 7a), 

2) Sequeace of the Lama2/APPA Construct 

A large segment of the Lama2 construct (Laursen and Hjorth 1997) used for 

1 5 constmction of the Lama2-APPA transgene had not been reported in GenBank prior to our 
research. To ensure that we could more clearly describe the transgene construct, and 
furthermore to avoid the introduction of deleterious DNA sequences from the mouse into the 
pig in the process of generating transgenic pigs, we sequenced the Lama2-APPA plasmid on 
both strands. Figure 4 illustrates schematically the structure of the Lama2-APPA plasmid. 

20 Figure 5 illustrates the nucleic acid sequence (SEQ ID NO: 1) of such plasmid. The fixll 
transgene sequence was reconstructed fi^om overlapping DNA sequences using the Contig 
Assembly Program (CAP) (http ://hercules. ti gem.it/ AS SEMBLY/assemble. htmO developed 
by Huang ( 1996; 1999) and then inspected manually for sequencing errors. The transgene 
sequence was checked for the presence of interspersed repetitive elements using the computer 

25 program RepeatMasker (Smith and Green, RepeatMasker at 

http://ftp.genome.washington.edu/cgi-bin/RepeatMasker). It was found that 26 % of the 
transgene sequence was composed of repetitive elements (Table 2). However, such repetitive 
elements are widely present in all mammalian genomes. For example, up to 50% of the 
human genome is derived from repetitive elements (Smit 1996; Kazazian 1998). 

30 Figure 23 illustrates the nucleic acid sequence (SEQ ID NO: 7) of the Lama2/APPA 

transgene construct. 

The Lama2 high level expression cassette (Laursen and Hjorth 1997) contains the 
enhancer region and the promoter of the Psp gene in the parotid gland. High expression was 
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shown to be dependent on regulatory elements bet^'een —1 1.5 kb and --6.5 kb and/or between 
+8.3 kb and +10.9 kb. Svendsen et al, ( 1998a) showed that a 1.5 kb sequence between -3.1 
kb and —4.6 kb had properties of a parotid and sublingual specific enhancer and was 
designated as the PSP proximal enhancer. Furthermore, they showed that transgenes 
5 containing the PSP promoter and 5' flarJdng region located between -3.6 kb and -4.3 kb 
contained sequence information necessary to direct salivary gland specific expression. 

Screening the transgene with RepeatMasker did not reveal the presence of any full- 
length active autonomous elements. The repeats present were extensively modified by 
insertions and deletions. The blastx program was also used to compare the transgene 

10 sequence translated in all reading frames against the National Center for Biotechnology 
Information (NCBI) protein sequence database rhttp://www.ncbi.nlm.nih.gov/BLAST/) 
(Altschul et al 1990;Gish and States 1993;Terada andNakanuma 1993). A region of DNA 
from 861 to 2180 was found that might code for parts of a protein with limited homology 
(38-58% identities) to the C-terminus of several human and mouse reverse transcriptases. 

15 However, the region was extensively modified by mutations with multiple fi^me shifts and 
inversions, and probably represented renmants left from the reverse transcriptase gene of a 
LINE element It is unlikely that it would be active, due to extensive modifications in the 
amino acid sequence such that only 18% of the full reverse transcriptase sequence was 
present and the highly conserved amino acid motif (Y/FXDD) was absent from the sequence 

20 (Xiong and Eickbush 1990). The complete sequence was also scanned for the presence of 
open reading frames (ORFs) that code for proteins using the program GENSCAN 
f http://CCR-08 1 .mit.edu/GENSCAN.htmn (Burge and Karlin 1997). Only one gene was 
found and it corresponded to the APPA phytase gene. GENSCAN unexpectedly predicted a 
different N-terminus for the phytase than would have been expected from the sequence. 

25 However, that could have resulted from the lower accuracy of GENSCAN for detecting 
initiation sites (Burge and Karlin 1998). 

3) Generation of Transgenic Mice Expressing a Constitutive Salivary Phytase 

In the following description, a pair of founder mice, incorporating the phytase gene 
and a constitutive promoter, were prepared under contract by the University of Alabama. As 
30 will be discussed, these founders were used to produce offspring, which were then analyzed 
for the presence of the phytase gene by PCR and animals containing the gene were then 
tested constitutive salivarj' phytase production. 
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Two transgenic founder mice (a black male and a white female, 3-1) containing the 
phytase transgene were received from the NICHD Transgenic Mouse Development Facility 
at the University of Alabama. The black male was negative for salivary phytase, but the 
female, 3-1, exhibited a salivary phytase activity of 30 U/ml. Progeny produced by crossing 
5 the black male with 4 CD-I females produced 9 out of 25 females and 13 out of 26 males that 
were PGR positive. All progeny were negative for salivary phytase. The female founder, 3- 
1, was out-crossed with a CD-I male to produce 3 litters for a total of 35 offspring. Of the 
progeny from these matings one phytase positive Gl male was obtained. When the Gl male 
was outcrossed with 6 CD-I females, of the 6 litters 20/34 males were PGR positive and 
10 salivary phytase positive and 21/28 females were PGR positive and salivctry phytase positive 
(Table 3). The salivary phytase activity of different offspring from the same first generation 
(Gl) male ranged from 1 .3 to 7 1 .2 U/mL There was no significant difference in the phytase 
activities between male or female mice. 

PGR assays for identification of the transgenic mice were carried out with an initial 
15 heating step at 95''C for 3 min, 40 cycles using 95''C for 30 sec, 54'*C for 30 sec and 72''C for 
1 min) using the following primers: APPA-UP2 and APPA-KPN (Figure 6). 

The phytase assays were conducted as described above for the R15-PRP/APPA 
phytase expressed in cell culture. 

20 4) Production of Transgenic Pigs Containing the Phytase Transgene Lama 2/APPA 

Transgenic pigs were produced using Yorkshire and Yorkshire/Landrace cross gilts as 
the embryo donors and Yorkshire sows as flie recipients. The experimental procedure used 
was similar to that described by Wall et al. ( 1985). The detailed procedure is described 
below. The Lama2/APPA construct v^th the APPA signal peptide was used as the transgene 

25 for microinjection. 

Methodology for the generation of transgenic pigs 

The following is a description of the preferred method of generating transgenic pigs 
according to the invention. However, it will be apparent to those skilled in the art that 
various other methods are also applicable. 

30 

a) Superovulation of prepuberal gilts and sows. 

Selected Yorkshire or Yorkshire/Landrace cross gilts between 70 to 80 kg were 
superovulated by intramuscular injectioirof 2000 lU of pregnant mare's serum gonadotropin 
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(PMSG, Ayerst Veterinary Laboratories), followed by 700 lU human chorionic gonadotropin 
(HCG, Ayerst Veterinary Laboratories) 60 to 72 hours later, administered in the same 
manner. The gilts were artificially inseminated three times with a 16 hour interval between 
inseminations using semen from a high breeding index Yorkshire boar. Twenty-four hours 
5 after the last insemination, the gilts were slaughtered and the reproductive tract recovered. 

h) Synchronization of estrus in recipients 

Estms was synchronized in experienced recipient sows as described for donor sows. 
Since synchronization and not superovulation was the goal, hormone levels were reduced to 
10 500 lU for PSMG and 500 lU for HCG. PMSG was given the day the sow's litter was 

weaned, followed in 72 hours by HCG and surgery for embryo transfer was performed 54 
hours thereafter, 

c) Embryo collection 

15 Reproductive tracts were collected at the abattoir, inserted into bags, sealed and the 

bags inmiersed in water at 39^C for transport to the laboratory. Recovery of the embryos and 
microinjection with the transgene was conducted in a laboratory maintained at 32 to 33°C. 
The oviducts were dissected from the tracts and flushed, using a syringe and a feeding tube, 
with 15 ml of pre-warmed HBECM-3 medium (Dobrinsky et al 1996). The media was 

20 collected in a 100 mm Petri dish and placed in an incubator at 38.5°C with an atmosphere of 
5% (vol/vol) of CO2, 5% (vol/vol) O2 and the balance N2. After all tracts were flushed, 
embryos were individually collected from the flushed media using a polished transfer pipette. 
Embryos were rinsed twice in 3 ml volumes of pre-incubated BECM-3 and placed in 100 p.1 
of pre-incubated BECM-3 under 3 ml of filter sterilized mineral oil until injected. 

25 

d) Pronuclear injection 

Embryos from one gilt were collected and placed in one ml of pre-warmed HBECM-3 
in a 1.5 ml centrifuge tube and centrifiiged for 6 min at 14,000 x g (Wall et al 1985). The 
embryos were then collected and placed in an injection dish with 40 \iX of pre-warmed 
30 HBECM-3 covered with 2.5 ml of filter sterilized mineral oil. The pronucleus in each 
embryo was injected (Gordon et al 1980) with three picolitres of Lama2/APPA DNA in 
solution at a concentration of 5 ng of DNA per p.1 in 10 mM Tris, pH 7.5, 0.1 mM EDTA. 
After injection, the embryos were placed dishes containing 100 \i\ of pre-incubated 
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BECM-3 under 3 ml of filter sterilized mineral oil. After all embryos were injected, which 
took no more than 4 hours since collection of reproductive tracts, the embryos were 
transferred to 1.8 ml cryotube (Nunc) containing 1 ml of pre- warmed HBECM-3 and 
transported in an incubator at 38.5°C to the swine surgery. 

5 

e) Embryo transfer 

Recipient sows were anesthetized by intravenous injection of 500 mg Brietol and 
anesthesia maintained by inhalation of 3% halothane with 4 litres per min of nitrous oxide 
and 4 litres per min oxygen. The oviducts were exposed through a laparotomy, just off the 
10 dorsal midline, and a catheter, containing 20 to 35 injected embryos and 3 to 6 untreated 

embryos, was passed into the infimdibulum and down the oviduct to the isthmus and emptied. 
The oviduct was returned to the abdominal cavity and the incision closed. 

f) Growth of pigs 

15 New-bora piglets were kept together imtil weaning. At that time males and females 

were separated and penned with non-transgenic same sex pigs of a similar age from other 
litters. The pigs are fed ad libitum starter rations until 25 kg wt , grower diet from 25 to 60 
kg wt and finisher diet from 60 kg to market weight. Water is available ad libitum. 
Transgenic pigs 167-02, 282-02 and 282-04 were maintained on a low phytate ration until 85, 

20 50, and 50 days of age, respectively, and then switched to the grower ration. All other 
transgenic pigs were given the standard high phosphorus diets. 

The diets were provided as pelleted formulations during the weanling, grower and 
finishing phases are shown in Tables 4 and 5. The vitamin and mineral mixes included in the 
diets are shown in Tables 6 and 7. 

25 

PCR analysis 

Tail segments from newborn piglets were collected and slices of each placed in 600 \i\ 
of 50 mM NaOH and heating at for 95°C for 15 minutes. The suspension was neutralized 
with 50 jil of 1 M Tris (pH 8.0) and insoluble materials removed by centrifugation for 5 min 
30 in a microcentrifuge. A 2 jj.1 sample of each was used for PCR with primers APPA«UP2 and 
APPA-KPN. 

The primers produce a 750 bp fragment if the transgene is present. As a positive 
control PIG-BGF and PIG-BGR primers were used to detect the porcine p-globin gene from 
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the same DNA preparation (Heneine and Switzer 1996), The PCR reaction was performed 
using the same conditions as described for detection of the phytase transgene. As a negative 
control genomic DNA from a non-transgenic pig was used in the PCR reaction, for a positive 
control this DNA was spiked with a known amount of transgene (1 gene copy/per genome). 
5 When a positive signal was identified by PCR for pig 167-02 (Figure 3) another DNA 

preparation was made and two more pairs of PCR primers were used to test for gene integrity 
(Figure 4) APPA-MATURE with APP A-KPN, and APP A-MATURE with APPA-DOWN2 
PCR conditions were similar to those described previously. 

10 Extraction of DNA from blood for PCR analysis 

The method for extraction of DNA from blood was based on a method described by 
Higuchi ( 1989) with some modifications. A 100 ^il volume of whole blood was mixed with 
200 111 of lysis buffer (10 mM Tris-HCl, 0.32 M sucrose, 5 mM MgCh, 1% (vol/vol) Triton 
X-100, pH 7.5.), mixed briefly and incubated on ice for 5 min. The sample was then 

15 centrifiiged at 14,000 x G for 3 min, and the superaate discarded. The sediment was 

suspended in lysis buffer, mixed, incubated and centrifiiged. This procedure was repeated 2 
more times, or until no hemoglobin remained. The sediment was dissociated in 100 p.1 of 50 
mM NaOH, mixed and heated at lOO^'C for 10 min. The contents were cooled, 10 fil of 1 M 
Tris-HCl (pH 8,5) added and mixed briefly. The sample was then centrifiiged at 14,000 x g 

20 for 2 min and 2 (il of the supemate used for analysis by PCR. 

The PCR reaction mixture with a total volume of 40 |il consisted of; 23.8 |il of 
distilled water, 4 >xl of 10 X Gibco BRL PCR buffer, 1.2 \il of 50 mM MgCb, 0.8 ^il of 10 
mM dNTPs, 40 pmol of each of the forward and reverse primers in 8 ^il, 2 \i\ of template 
DNA and 0.2 fxl of Tag DNA polymerase (Gibco BRL, 5 V/yl). The amplification procedure 

25 was performed with an initial heating step at 95°C for 3 min followed by 40 cycles of 95^C 
for 30 sec, 54*'C for 30 sec and 72'*C for 60 sec. 

The transgenic pigs were detected with primers for the APPA gene (APP A-KPN with 
APPA-UP2), and as a control PIG-BGF with PIG-BGR primers were used for detection of 
the porcine (3-globin gene. 

30 

Saliva collection from piss for ohvtase assays and weighing of pigs 

Weanling pigs were sampled for salivary phytase by wiping under the tongue with a 
cotton tipped applicator, breaking the stick offand centrifuging the applicator tip in a 0.4 ml 
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microcentrifuge tube, with a hole in the bottom, contained within a 1.5 ml microcentrifuge 
tube. Grower and finishing pigs were sampled using 1 .5 inch long #2 dental cotton absorbent 
rolls (Ash Temple Sundries Ltd, Don Mills, ON) attached to dental floss. These were 
centrifuged in 1.5 ml microcentrifuge tubes with holes in the bottom while contained in larger 
5 tubes. The saliva was collected from the larger tube and stored at -20^C until analyzed. 
Saliva was collected and pigs were weighed at weekly intervals. 

Analysis for phvtase activity. 

Saliva samples were either assayed directly or afier dilution in 0.1 M acetate buffer 

10 pH 4.5, Phytase was assayed in 200 jal of 0.1 M sodium acetate buffer (pH 4.5) using sodium 
phytate (4 mM) as a substrate at 37^C. After 10 min of incubation the reaction was stopped 
by addition of 133 ^1 ammonium molybdate/ammonium vanadate/nitric acid mixture and the 
concentration of liberated inorganic phosphate determined at 405 nm (Engelen, van der 
Heeft, Randsdorp, and Smit 1994). This and all other assays were performed in triplicate. 

15 One unit (U) of enzyme activity was the amount of the enzyme releasing 1 ^rniol of inorganic 
phosphate per minute. 

Assays for salivary phytase and for phytase in blood samples were conducted as 
previously described for saliva samples. A reagent blank with blood added at the same 
concentration as the samples assayed was subtracted from the sample readings. 

20 

Collection of fecal materials and analysis for total phosphorus 

Fresh feces were collected from each pig during the grower and finisher phases. 
Samples were placed in aluminum trays closed with a wax paper top and immediately frozen, 
and kept frozen until they were lyophilized for analysis. After lyophilization the samples 

25 were transferred to room conditions overnight to reach equilibrium in moisture content. The 
samples were separately ground with a mortar and pestle until homogenous and sealed in 
plastic containers until analyzed further. Dry matter content of samples was analyzed 
according to AO AC (Association of Official Analytical Chemists (AO AC) 1 984) by heating 
1 gram samples at 1 10*^C for 4 hours and cooling in a desiccator prior to weighing. To 

30 analyze total phosphorus content, samples were heated at 550°C in a muffle fiimace and 10 
ml of 10 M HCl added and heated to boiling. The contents from each sample was 
quantitatively diluted to 250 ml with water and inorganic phosphorus content was measured 
by the method of Heinoen and Lahti ( 1981). 
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Purification of the E, coli produced phvtase and pig salivary phvtase 

The APPA phytase was over expressed in E. coli strain BL21(DE3) and the EDTA 
lysozyme extract fraction purified on DEAE-Sepharose and Sephadex-G75 as described by 
5 Jia et aL ( 1998). The pig phytase was purified by chromatography on DEAE-Sepharose and 
the band of enzyme eluted with a sodium chloride gradient was further purified by 
Chromatofocusing using a pH gradient from pH 4.0 to 7.0. 



SDS-PAGE analysis and Silver Staining 
10 Sodium dodecylsulfate polyacrylamide gel electrophoresis was performed using a 

10% gel as described by Laenunli ( 1970), except that protein in the sample buffer was 
heated at 70°C for 10 minutes. Samples were stained with silver as described by Nesterenko 
et al. ( 1994). 



15 Preparation of a monoclonal antibody specific for the APPA encoded E, coli phytase 

Monoclonal antibodies specific to the £. coli APPA encoded phytase were prepared 
according to the procedures of Galfre and Milstein (1981). Briefly, two female Balb/c mice 
were immimized 7 times over a period of 59 days with a purified APPA enzyme preparation. 
Mouse spleens were harvested, and the cells therein fused with an NS-1 myeloma cell line 

20 (Kohler and Milstein, 1976). Fused cells were selected for their ability to grow in media 

containing hypoxanthine, aminopterin, and thymidine (HAT). Western blotting and Enzyme- 
Linked Immunosorbent Assays (ELISA) were used identify those clones capable of secreting 
an antibody into the culture medium that recognized epitopes on both the E. coli and pig 
derived APPA enzyme. Clones secreting a desirable antibody were subcloned twice to 

25 ensure a pure culture of antibody secreting hybridomas. 

Production of Polyclonal Antibodies Against the Purified E. coli derived APPA Phvtase 

Antibodies were prepared in two New Zealand White Rabbits by two intramuscular 
injections at different sites in the thigh of 50 |Jtg of purified Escherichia coli derived APPA 
30 phytase in 0.5 ml of a 1 :1 mixture of phosphate-buffered saline (PBS) and Freund's Complete 
Adjuvant. This was followed by repeat injections of 20 fig each of phytase in a 1 :1 mixture 
of PBS and Freund's Incomplete Adjuvant on days 4, 19, 25, and 39. Blood was collected 
via heart puncture on day 42. The serum was separated from the cell fraction and used as the 
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source of antibodies. The basic procedures for antibody production are described in Harlow 
and Lane (1988). 



Western blotting 

5 Western blotting was performed as described by Towbin et al. (Towbin et al 1979). 

Deglycosylation of pig phytase was done according to protocols, Roche Molecular 
Biochemicals, with following modifications. Protein in 50 mM Tris (pH 8.0), 1 mM EDTA, 
1% SDS, 1% 2-mercaptoethanol was denaturated by heating at 95** C for 3 min. Than protein 
was precipitated with chloroform-methanol method (Wessel and Flugge 1984) and 
10 resuspended at 100 iig/nxL in 20 mM Sodium Phosphate (pH 7.2) with 1% Triton X-100. 

Complete deglycosylation of 5 jig in 50 |iL phytase was carried out overnight at 37°C using 
1 unit (U) N-glycosidase F, 1.2 mU O- glycosidase and 1 mU neuraminidase (Boehringer 
Mannheim GmbH). After incubation 0.5 |xg of protein was run on the SDS gel. 



15 Staining of glycoproteins 

This staining was done using DIG Glycan Detection Kit (Boehringer Mannheim) 
according to manufacture instructions (O'Shannessy et al. 1987). 

Statistics on the generation of transgenic pigs 

The statistics on embryos recovered, microinjected and transferred into donor sows is 
20 shown in Table 8. A total of 4147 embryos injected with the transgene and 675 untreated 

embryos were introduced into 140 recipient sows with an average of 30 injected embryos and 
5 uninjected embryos. All offspring were tested for the presence of the transgene in tissue 
biopsy, in blood by PGR analysis, and by an assay for phytase activity in the saliva. 

Table 9 lists the transgenic pigs that were produced, their birth dates, sex and salivary 
25 phytase levels. There were 3 1 pigs transgenic for the phytase gene out of 203 live piglets 

bom from embryos microinjected. These were detected by the presence of the gene in blood 
samples using the standard primer set, APPA-UP2 and APPA -KPN, but only 14 were 
detected by analysis of tail DNA preparations using the standard primer set. When the 
negative samples were reanalyzed using the primer set LAMA-UPl and APPA-<iown4 
• 30 (Figure 8) a further 8 tail DNA samples were found to be positive. Purification of the tail 
biopsy DNA probably would have led to all being PGR positive for the phytase transgene. 
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Characteristics of the phvtase transgene in transgenic pig 167-02 

The application of PCR to detection of transgenic pigs is exemplified by analysis of 
litter 167 in which one of 7 piglets tested, including one that was stillborn and one that was 
crushed by the sow after birth, one live piglet designated 167-02 was identified as positive for 
the APPA gene by generation of a PCR product (Lane 2) of approximately 750 bps from the 
tail chromosomal DNA (Figure 7). No rearrangements of the APPA gene were detected as 
documented by the positive PCR results using primers directed to the 3' region (lane 2) the 
whole gene (lane 3) and the 5* region (lane 4) of the APPA gene (Figure 8). 

Salivary phvtase and weight gain during growth of transgenic and non-transgenic penmates. 

Data on salivary phytase activity and weight gain are shown for five transgenic pigs 
and for weight gains of their non-transgenic penmates in Figures 9, 10, 11, 12 and 13. The 
phytase activity in the saliva varied substantially fi:om one sampling time to the next. This 
variability was attributed to a combination of environmental factors including whether the 
animal had just consumed food or water, and regulation of parotid and saliva secretion in 
relation to food and water consumption. The weight gains during grov^ of the five 
transgenic pigs was within the range of the weight gains of the normal non-transgenic pigs. 

With the exception of 167-02 the growth rate of the transgenic pigs was similar to that 
of the non-transgenic litter mates. 

Phosphorus content in the fecal materials from transgenic and non-transgenic pigs. 

The phosphorus content of fresh fecal samples from three of the transgenic foimder 
pigs, 167-02, 282-02, 282-04, 405-02 and 421-06 receiving weaning, grower or finisher 
ration is shown in Table 9. The phosphorus content of the feces of the transgenic pigs ranged 
from 1.59 to 2.26% while that of the non-transgenic penmates ranged from 1.61 to 2.76 %. 
The reduction in fecal phosphorus ranged from a maximum of 26% to a minimum of 8%. In 
most cases the differences were at the 99% level of significance. The ages of the pigs at the 
time of fecal sampling and the corresponding phytase activities are shown in Figures 9, 10, 
1 1, 12 & 13. The rations fed contained a supplement of readily available phosphorus suitable 
for maximizing growth of non-transgenic pigs. Since the reduction in fecal phosphorus is 
measured in transgenic pigs receiving a diet high in mineral phosphorus it is very likely that 
the fecal phosphorus would be substantially lower if the diet lacked mineral phosphorus. 
Under these conditions the phosphorus released from phytate would provide a substantial 
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proportion of the dietary phosphorus and little would reach the large intestine and be excreted 
in the feces. 

Transmission of the phvtase transeene (to be completed) 
5 When semen from the transgenic boar 167-02 was used to inseminate four Yorkshire 

gilts all four sows had litters in which 4 out of 8, 2 out of 9, 7 out of 8 and 2 out of 5 of the 
piglets were transgenic for the phytase gene (Table 11). The PGR data for litter 154 that 
documents the presence of the transgene is shown in Figure 14. All pigs containing the gene 
exhibited phytase activity in the saliva, and it ranged from 341 to 10,077 units per ml. Half 
10 of the transgenic piglets had salivary ph3^e activities of greater than 2000 units per ml. The 
specific activity of the phytase in the saliva ranged from 39 U/mg protein to a high of 706 
units/mg protein. 

This data documents that the gene was transferred and that the level of phytase 
expression observed in the founder was preserved in the first generation of pigs. Both male 
15 and female pigs at 1 1 days of age exhibited high phytase activity. 



Characteristics of the phvtase enzyme synthesized in the salivary glands of the pig 

The phytase enzyme was purified to homogeneity from coli and from saliva 
collected from transgenic pig 167-02. Silver stains of the purified enzymes after SDS-PAGE 

20 are shown in Figure. 15. The E. coli derived enzyme has a molecular mass of approximately 
45 kDa while that produced by the pig was about 55 kDa. The enzymes were also 
electrophoresed as before, transferred to nitrocellulose and stained for glycoproteins. The 
second part of Figure 15 shows that the pig APPA protein is glycosylated. Figure 15B shows 
that treatment of the pig phytase with deglycosylation enzymes changes the size of the 

25 phytase from 60 kDa to 45 kDa, an observation that confimis the glycosylated nature of the 
recombinant phj^ase produced in the saliva of the pig. 

The data in Figure 16 shows that the pig phytase is homologous with the E. Coli 
enzyme despite their difference in size. 

The purified pig phytase had Km and Vmax values of 0.33 mM and 624 units per mg of 

30 protein, respectively, Golovan et al. (2000) previously reported the Km and V„ax for the E, 
coli enzyme to be 0.63 mM and 2325 units per mg of protein. Thus the salivary phytase 
exhibits approximately 25% of the activity of the E, coli enzyme. This reduction in activity 
may be due to glycosylation that either modifies the catalytic site of the enzyme or otherwise 
leads to the formation of an enzyme with lower catal>tic activity. 
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The latter finding of the production of a glycosylated protein suggests a methoS'of 
producing such proteins using transgenic animals. Currently, although recombinant methods 
are available for producing proteins in host cells, it is often found that the mature peptide 
lacks the glycosylation normally associated with proteins produced by higher life forms. 
5 Insulin is an example of such protein. The fmdings of this study suggest that one means of 
producing the desired glycoproteins would be to generate transgenic animals such as the pig, 
that have been transformed, by known methods or the method described above, with a gene 
encoding the desired protein. When expressed by such animal, the subject protein would be 
produced and would undergo post-translational processing in the cell including the step of 

10 glycosylation. Thus, the invention contemplates a general method of producing such 
glycosylated proteins. Further, the invention contemplates a method of producing 
glycosylated proteins through the expression in and isolation fi'om the saliva of an animal that 
has been transformed with a gene encoding such protein, and wherein such gene is operably 
linked to a saliva protein promoter or enhancer. 

15 Various methods are known in the art for the collection of glycoproteins fix>m the 

parotid gland of the pig for various applications. For example, surgical techniques have been 
published by Deimy et al. (1972) for the collection of secretions firom the parotid gland and 
submandibular salivary ducts. 

20 Test kit for detection of the APPA phvtase protein in pigs 

The monoclonal antibodies produced against the APPA phytase expressed in E, coli 
reacted with the APPA phytases produced in the saliva of transgenic mice and pigs (Figure 
1 7). Immunological detection of phytase in saliva provides definitive proof that the phytase 
secreted in transgenic pig saliva is a product of the APPA gene expressed in the pig salivary 
25 gland. This serves as a reliable method to document phytase production in transgenic pigs. 

A fiuther test would also be obtainable using the polyclonal antibodies discussed 

above. 

The DNA sequence encoding phytase may be obtained from a variety of sources such 
30 as microbial, plant or animal sources. Preferably, the DNA sequence is obtained from a 
microbial source such as bacteria. Most preferred DNA sequences are obtained from 
Escherichia coli. 

The cloning of a gene or a cDNA encoding a phytase protein may be achieved using 
various methods. One method is by purification of the phytase protein, subsequent 
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determination of the N-tenninal and several internal amino acid sequences and screening of a. 
genomic or cDNA library of the organism producing the phytase using oligonucleotide 
probes based on the amino acid sequences. If at least a partial sequence of the gene is known, 
this information may be used to clone the corresponding cDNA using, for instance, the 
5 polymerase chain reaction (PGR) (PGR Technology: Principles and Applications for DNA 
Amplification, (1989) H. A. Ehrlich, ed., Stockton Press, New York; the contents of which 
are incorporated herein by reference). It will be evident to those skilled in the art that the 
cloned phytase gene described above may be used in heterologous hybridization experiments, 
directed to the isolation of phytase encoding genes from other microorganisms. 

1 0 The DNAs encoding ph3^ase or individual fragments or modified proteins thereof can 

be fused, in proper reading frame, with appropriate regulatory signals as described in detail 
below, to produce a genetic construct that is then amplified, for example, by preparation in a 
bacterial (e.g., E, coli) plasmid vector according to conventional methods. Such methods are 
described in, for example, Sambrook et al.. Molecular Gloning: A Laboratory Manual (Cold 

15 Spring Harbor Press 1989), the contents of which are incorporated herein by reference. The 
amplified construct is thereafter excised from the vector and purified for use in producing 
transgenic animals. 

The desired protein may also be produced as a fiision protein containing another 
protein. For example, the desired recombinant protein of this invention may be produced as 

20 part of a larger recombinant protein in order to stabilize the desired protein. Useful 
modifications within this context include, but are not Umited to, those that alter post- 
translational modifications, size or active site, or that fiise the protein or portions thereof to 
another protein. Such modifications can be introduced into the protein by techniques well 
known in this art, such as by synthesizing modified genes by ligation of overlapping 

25 oligonucleotides or introducing mutations into the cloned genes by, for example, 
oligonucleotide-mediated mutagenesis. 

The cloned phytase gene may be used as starting materials for the construction of 
improved phytases. Improved phytases are phytases, altered by mutagenesis techniques (e.g. 
site-directed mutagenesis, or directed evolution), which have properties that differ from those 

30 of wild-type phytases (Kuchner and Arnold 1997). For example, the temperature or pH 
optimum, specific activity, temperature or protease resistance may be altered so as to be 
better suited for a particular application. 

A choice of expression in cellular compartments (such as c)^osol, endoplasmic 
reticulum) or extracellular expression can be used in the present invention, depending on the 
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biophysical and biochemical properties of the phytase. Such properties include, but are not 
limited to pH sensitivity, sensitivity to proteases, and sensitivity to the ionic strength of the 
preferred compartment. The DNA sequence encoding the enzyme of interest should be 
modified in such a way that the enzyme can exert its action at the desired location in the cell. 
5 To achieve extracellular expression of the phytase, the expression construct of the present 
invention utilizes a bacterial signal sequence. Although signal sequences that are 
homologous (native) to the animal host species are preferred, heterologous signal sequences, 
i.e. those originating from other animal species or of microbial origin, may be used as well. 
Such signal sequences are known to those skilled in the art. 

10 All parts of the relevant DNA constructs (promoters, regulatory, secretory, stabilizing, 

targeting, or termination sequences) of the present invention may be modified, if desired, to 
affect their control characteristics using methods known to those skilled in the art. The cis- 
acting regulatory regions useful in the invention include the promoter that drives expression 
of the phytase gene. Highly preferred are promoters that are specifically active in salivary 

15 gland cells. Among such promoters, highly preferred are mouse parotid secretory protein 
(PSP) promoter, rat proline-rich protein (PRP) promoter, human salivary amylase promoter, 
mouse mammary tumor virus promoter (Samuelson 1996). Among the usefiil sequences that 
regulate transcription, in addition to the promoters discussed above, are enhancers, splice 
signals, transcription termination signals, and polyadenylation sites. Particularly useful in 

20 this regard are those that increase the efficiency of the transcription of the genes for phytase 
in the salivary gland or other cells of the transgenic animals listed above. Preferred are 
transcription regulatory sequences for proteins highly expressed in the salivary gland cells. 
Introns could be introduced to increase levels of expression. Such introns include the 
synthetic intron SIS, SV40 small t antigen intron and others (Whitelaw et aL 1991; Petitclerc 

25 e/fl/. 1995). 

Preferably, the expression system or construct of this invention also includes a 3* 
untranslated region downstream of the DNA sequence encoding the desired recombinant 
protein, or the salivary protein gene used for regulation. This region apparently stabilizes the 
RNA transcript of the expression system and thus increases the yield of the desired protein. 
30 Among the 3' untranslated regions useful in this regard are sequences that provide a polyA 
signal. Such sequences may be derived, e.g., from the S V 40 small t antigen late 
polyadenylation signal, s)aithetic polyadenylation signal or other 3' untranslated sequences 
well known in this art (Carswell and Alwine 1989;Levitt et al. 1989). Preferably, the 3' 
untranslated region is derived firom a salivary-specific protein. The stabilizing effect of this 
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region's polyA transcript is important in stabilizing the mRNA of the expression sequence. 
Further, the addition of locus control regions (LCRs), matrix attachment regions (MAR) and 
scaffold attachment regions (SARs) would allow position-independent, copy number 
dependent expression of the transgene with either homologous or heterologous promoters 
5 (Taboit-Dameron et al 1999;Geyer 1997). Co-integration of an actively expressed gene with 
the transgene was also shown to increase expression levels of a poorly expressed transgene 
(Clark et al 1993). Also important in increasing the efficiency of expression of phytase is a 
strong translation initiation site (Kozak 1987). Likewise, sequences that regulate the post- 
translational modification of phytase may be useful in the invention. 

10 The term "animal" as used herein denotes all animals except humans. It also includes 

an individual animal in all stages of development, including embryonic and fetal stages. 

A "transgenic" animal is any animal containing cells that bear genetic information 
received, directly or indirectly, by deliberate genetic manipulation at the subcellular level, 
such as by microinjection or infection with a recombinant virus. "Transgenic" in the present 

15 context does not encompass classical crossbreeding or in vitro fertilization, but rather denotes 
animals in which one or more cells receive a recombinant DNA molecule. Although it is 
highly preferred that this molecule be integrated within the animal's chromosomes, the 
invention also encompasses the use of extrachromosomally replicating DNA sequences, such 
as might be engineered into yeast artificial chromosomes. The information to be introduced 

20 into the animal may be foreign to the species of the animal to which the recipient belongs 
(i.e., "heterologous"), or the information may be foreign only to the particular individual 
recipient, or genetic information already possessed by the recipient. In the last case, the 
introduced gene may be expressed in a manner different than the native gene. 

As indicated above, the transgenic animals of this invention are other than human. 

25 Farm animals (pigs, goats, sheep, cows, horses, rabbits and the like), rodents (such as mice 
and rats), domestic pets (eg. cats and dogs), fish and poultry (eg. chickens) are included in the 
scope of this invention. It is highly preferred that a transgenic animal of the present invention 
be produced by introducing into single cell embryos appropriate polynucleotides that encode 
phytase, or fragments or modified products thereof, in a manner such that these 

30 polynucleotides are stably integrated into the DNA of germ line cells of the mature animal, 
and are inherited in normal mendelian fashion. Advances in technologies for embryo 
micromanipulation now permit introduction of heterologous DNA into fertilized mammalian 
ova. For instance, totipotent or pluripotent stem cells can be transformed by microinjection, 
calcium phosphate mediated precipitation, liposome fusion, retroviral infection or other 
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means, the transformed cells are then introduced into the embryo, and the embryo then 
develops into a transgenic animal. In one preferred method, developing embryos are infected 
with a retrovirus containing the desired DNA, and transgenic animals produced from the 
infected embryo. In a most preferred method, however, the appropriate DNAs are co-injected 
5 into the pronucleus or cytoplasm of embryos, preferably at the single cell stage, and the 

embryos allowed to develop into mature transgenic animals. Such techniques are well known 
(see reviews of standard laboratory procedures for microinjection of heterologous DNAs into 
mammalian fertilized ova, including Hogan et al.. Manipulating The Mouse Embryo, (Cold 
Spring Harbor Press 1986); Krimpenfort et aL, Bio/Technology 9:844 (1991); Pakniter et ah, 

10 Cell, 41 : 343 (1985); Kraemer et al,. Genetic Manipulation Of The Early Mammalian 
Embryo, (Cold Spring Harbor Laboratory Press 1985); Hammer et al.. Nature, 315: 680 
(1985); Wagner et aL, U.S, Pat. No. 5,175,385; Krimpenfort et al., U.S. Pat. No. 5,175,384, 
the respective contents of which are incorporated herein by reference). 

For a person skilled in art, it will also be clear that the present invention provides for 

1 5 other proteins to be expressed in the salivary gland of the pig. Such proteins may be secreted 
into saliva to improve digestion and decrease pollution potential (for example, 
endoglucanases), or specifically targeted for secretion into blood and have effects on the 
growth and health of the animal (such as growth hormone). 

Phytase activity may be measured via a number of assays, the choice of which is not 

20 critic2Ll to the present invention. For example, the phytase enzyme activity of the transgenic 
animal tissue may be tested with an ELISA-assay, Western blotting or direct enzyme assays 
using calorimetric techniques or gel assay system. 

The examples included herein are provided so as to give those of ordinary skill in the 
art a complete disclosure and description of how to make and use the invention and are not 

25 intended to limit the scope of what the inventors regard as their invention. Efforts have been 
made to ensure accuracy with respect to numbers used (e.g., amounts, temperature, pH, etc.) 
but some experimental errors and deviation should be accounted for. Unless indicated 
otherwise, temperature is in degrees Centigrade and pressure is at or near atmospheric. 

30 

Although the invention has been described with reference to certain specific 
embodiments, various modifications thereof will be apparent to those skilled in the art 
without departing from the spirit and scope of the invention as outlined in the claims 
appended hereto. 
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Table 1. Secretion of phvtase in the saliva of transgenic mice containing the R1S-PRP7XPPA 
transgene and non-transgenic mice induced with isoproterenol and pilocarpine. 



Founder Mice PGR Gender Generation Transgene Fhytase activity 

micromoles/min/ml 



AOm 


4bfr (+) 


positive 


F 


I 


APPA+intron 


39.73 


AOm 


2brm(+) 


positive 


M 


1 


APPA+mtron 


24.29 


AOm 


2brai(+) 


positive 


M 


2 


APPA+intron 


14.42 


AOm 


5brf(+) 


positive 


F 


2 


APPA+intron 


7.36 


AOm 


IbrmC-) 


negative 


M 


1 


APPA+intron 


0.00 


Alf 


9brf(+) 


positive 


F 


1 


APPA+intron 


0.08 


Alf 


1 Iw f(+) 


positive 


F 


1 


APPA+introii 


0.07 


Alf 


5bnn(+) 


positive 


M 


1 


APPA+intron 


0.03 


Alf 


10wf{-) 


negative 


F 


1 


APPA+intron 


0.02 


A20f 


lbnn(+) 


positive 


M 


1 


APPA+intron 


0.53 


A20f 


5brf(+) 


positive 


F 


1 


APPA+intron 


0.12 


A20f 


4brf (-) 


negative 


F 


1 


APPA+intron 


0.03 


A2m 


I3wf(+) 


positive 


F 


1 


APPA+intron 


87.70 


BOm 


5brf(+) 


positive 


F 


1 


APPA+intron 


0.95 


BOm 


3brm(+) 


positive 


M 


1 


APPA+intron 


0.73 


BOm 


6wf(-) 


negative 


F 


1 


APPA+intron 


0.00 


BOf 


3wf(+) 


positive 


F 


2 


APPA 


252.43 


BOm-intr 


9wf(+) 


positive 


F 


1 


APPA 


546.74 


WOm 


8wf(+) 


positive 


F 


1 


APPA 


60.42 


W30m 


lwm(+) 


positive 


M 


2 


APPA 


41.91 


W30m 


llwf(+) 


positive 


F 


1 


APPA 


43.44 


W30m 


4wm(-) 


negative 


M 


1 


APPA 


0.02 


W30m 


lOwf(-) 


negative 


F 


I 


APPA 


0.02 
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Table 2. Repeat sequences found in the Lama2-APPA consUuwt. 



Start 


End 


DNA 
strand 


Repeat 


Class/femily 


Substitu- 
tions % of 
consensus 


Deletions 

%of 
consensus 


Insertions 

%of 
consensus 


765 


927 


+ 


LIMl 


LINE/Ll 


25 


4.2 


6.7 


928 


965 


+ 


(CA)n 


Simple repeat 


0 


0 


0 


966 


1020 


+ 


LlMl 


LINE/Ll 


25 


4.2 


6.7 


1021 


1156 


+ 


Bl MM 


SINE/Alu 


15.4 


0 


0 


1159 


1231 


+ 


CAAAC)n 


Simple repeat 


1.4 


0 


0 


1232 


1385 


+ 


LIMl 


LINE/Ll 


25 


4.2 


6.7 


1652 


2308 


C 


LI 


LINE/Ll 


28.5 


11.9 


1.7 


2334 


2406 


c 


MIR 


SINE/MIR 


27.4 


4.1 


0 


2415 


3266 


+ 


RMER13A 


LTR 


17.7 


4 


6.1 


6016 


6127 


c 


L1MA9 


LINE/Ll 


25.5 


2 


1 


6831 


7007 


+ 


CT-rich 


Low 
complexity 


30.5 


1.7 


3.4 


7299 


7510 


c 


B3 


SINE/B2 


27.8 


7.5 


1.4 


7718 


7746 


+ 


(TCTCTG)n 


Simple repeat 


6.9 


0 


0 


8499 


8581 


c 


MIR 


SINE/MIR 


24.1 


12.1 


3.6 


9010 


9603 


+ 


Lx4 


LINE/Ll 


21.7 


6.4 


0.2 


10465 


10519 


+ 


(TG)n 


Simple repeat 


5.5 


1.8 


0 


11235 


11287 


c 


MER5A 


DNA/MERl 
type 


28.3 


0 


1.9 


12372 


12537 


c 


L1MA4A 


LINE/Ll 


28.3 


5.4 


0 


14240 


14388 


+ 


B1_MM 


SINE/Alu 


4 


0 


1.3 


14869 


14945 


c 


MIR 


SINE/MIR 


36.4 


1.3 


0 


16391 


16540 


c 


ORRID 


LTIi/MaLR 


29.3 


0 


6 


16774 


17214 


+ 


RMER4 


LTR 


21.3 


10 


11.8 


17229 


17718 


C LI MM 


LINE/Ll 


15.3 


0 


0.8 
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Table 3, Salivary phvtase activities of G2 mice from the founder female 3-1 
generated usine the construct Lama2-APPA. The mice were between 21 and 30 
days of age. 



male mouse # 


Phvtase (U/ml) 


female mouse # 


Phvtase (U/ml) 


5 


28.3 


1 


9.0 


6 


2.5 


2 


29.9 


8 


6.6 


4 


8.0 


9 


44.7 


5 


43.0 


10 


12.7 


6 


26.9 


12 


28.3 


8 


1.9 


15 


28.1 


9 


66.3 


18 


71.2 


10 


19.9 


19 


19.5 


11 


61.3 


20 


15.7 


12 


36.4 


21 


20.9 


13 


18.0 


22 


4.1 


17 


38.9 


24 


13.0 


18 


18.5 


26 


53.4 


19 


27.0 


28 


20.4 


23 


6.5 


29 


34.1 


24 


16.1 


30 


11.1 


25 


9.4 


32 


3.1 




26 


14.8 


33 


51.7 


27 


1.3 


34 


19.0 


1 28 


8.2 
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Table 4. Composition and nutrient levels of Phase II starter diet and low phvtate 
starter diets fed to weanling pies between S-10 kg. 



ingrecLients 


Diet/Nutrient Levels^ 




Phase II Starter Diet 


Low Phytate Starter Diet 


Com 


33.15 


25.44 


Barley 


8-00 


8.00 


Wheat 


20.00 


40.00 


Soybean meal 


2 LOO 


8.00 


Fish meal 


5.00 


5.00 


Meat and bone meal 


- 


1.00 


Whey 


8.00 


8.00 


Fat 


2.00 


2.00 


Lysme-HCi 


0.10 


0.28 


Dicalcium phosphate 


1.10 




CaCOa 


0.90 


1.10 


Iodized salt 


0.30 


0.30 


Vitamm premix 


0.250 


0.55 


Mineral premix 


0.10 


0.10 


Linconunix 44 


0.10 


0.10 


Total (kg) 


100.00 


100.00 








Calculated nutritive values 






DE (kcal/g) 


3.44 


3.36 


CP (%) 


19.46 


18.62 


Ca (%) 


1.00 


0.94 


Total P (%) 


0.74 


0.66 


/TPS 

Ca/P 


1.35:1 


1.42:1 


Total AA contents (%) 






Argmine 


1.16 


1.17 


Histidme 


0.50 


0.48 


ISO leucine 


0.81 


0.77 


Leucine 


1.58 


1.54 


Lysine 


1.17 


1.06 


Methionine 


0.34 


0.29 


Cysteine 


0.34 


0.34 


Methionine-hCysteine 


0.68 


0.63 


Phenylalanine 


0.90 


0.90 


Tyrosine 


0.65 


0.65 


Threonine 


0.75 


0.68 


Tryptophan 


0.23 


0.23 


Valine 


0.91 


0.86 



Minerals and vitamins meet or exceed levels recommended by NRC (1998). 
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Table 5. Composition and nutrient levels of grower and finisher diets. 



Ingredients 


Diet/Nutrient Levels 




Grower Diet 
For pigs 20 to 50 kg 


Finishing Diet 
For pigs 50 to 120 kg 


Com 


51.78 


40.00 


Barley 


8.10 


23.03 


Wheat 


20.00 


23.00 


Soybean meal 


16.00 


13.00 


Fat 


1.00 


1.00 


Lysine-HCl 


0.12 


0.12 


Dicalcium phosphate 


1.20 


1.00 


CaC03 


1.15 


1.15 


Iodized salt 


0.50 


0.50 


Vitamin premix 


0.15 


0.15 


Mineral premix^ 


0.10 


0.10 


Total (kg) 


100.00 


100.05 








Calculated nutritive values 






DE (kcaVg) 


3.39 


3.33 


CP (%) 


14.76 


14.17 


Ca (%) 


0.79 


0.74 


Total P (%) 


0.57 


0.53 


Ca/P 


1.39:1 


1.39:1 


Total AA contents (%) 






Arginine 


0.86 


0.80 


Histidine 


0.38 


0.36 


Isoleucine 


0.58 


0.55 


Leucine 


1.28 


1.18 


Lysine 


0.78 


0.73 


Methionine 


0.24 


0.23 


Cysteine 


0.29 


0.29 


Methionine+Cysteine 


0.53 


0.52 


Phenylalanine 


0.70 


0.68 


Tyrosine 


0.50 


0.46 


Threonine 


0.52 


0.49 


Tryptophan 


0.17 


0.16 


Valine 


0.68 


0.65 



Minerals and vitamins meet or exceed levels recommended by NRC (1998). 
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Table 6. Vitamin premix composition^ 



PCT/CAOO/00430 



Nutrient 



Amount per 5 kg of premix 



Wheat midds 
Vitamin A 
Vitamin D 
Vitamin E 
Menadione 
Pantothenic acid 
Riboflavin 
Folic acid 
Niacin 
Thiamin 
Pyridoxine 

Vitamin Bj2 
Biotin 

Choline 



3.867 kg 

10 million lU 

1 million lU 
40 thousand lU 

2.5 g 

15 g 

5g 
2g 
25 g 
1-5 g 

1.5 g 
25 mg 
200 mg 

500 g 



7 



From Hoffrnan-LaRoche Limited, P.O. Box 877, Cambridge, ON. N1R5X9 



Table 7. Composition of the mineral premix 



Mineral component 


Amount (%) 




Limestone 


43.3 




Copper sulfate (25%) 


6.0 




Ferrous sulfate (30%) 


33.4 




Zinc oxide (72%) 


13.9 




Manganous oxide (56%) 


3.4 





Mineral premix prepared at Arkell 
^Dicalcium phosphate contained 18.5% calcium and 20.5% of 
phosphate and nomially is added at a level of 1.2% to the pig 
grower diet, 1,0% to the finisher diet and 1.5% to the nursing sow 
diet. 
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Table 8. Statistics on embryo recovery and the introduction of embryos 
containing the transgene into recipient sows. 



Treatment Number 

Gilts used for embryo recovery: 

Yorkshire 279 
Yorkshire x Landrace cross 168 
Duroc 12 
Total 459 

Recipient sows ^ 74 

Embryos transferred to recipients: 

Embryos microinj ected with the transgene 4 1 47 

Uninjected carrier embryos 675 
Total 4543 

Total number of embryo transfers 140 



Sows were used for up to three farro wings of potentially transgenic 
pigs. Sows were inseminated with Yorkshire semen from a high 
breeding value boars. 
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Table 9. Transgenic pigs containing a salivary phvtase gene generated by microinjections of 

single cell zygotes using the LamaZ-APPA transgene 



ID # of Birth Date Presence of Sex Salivary phytase Zygote source 
pig' Transgene^Tail/Blood (U/ml) 



167-02 


Apr 14/99 


+/+ 


Boar 


6,000 


Yorkshire 


282-02 


Jun 14/99 


+/+ 


Boar 


618 


Yorkshire 


282-04 


Jun 14/99 


+/+ 


Boar 


1,349 


Yorkshire 


405-02 


Aug 14/99 


+/+ 


Gilt 


339 


York/Landrace 


421-02 


Aug 24/99 


-/+ 


Gilt 


0.8 


York/Landrace 


421-04 


Aug24/99 


-/+ 


Gilt 


2.2 


York/Landrace 


421-06 


Aug 24/99 


+/+ 


Boar 


97 


York/Landrace 


448-01 


Sep 03/99 


+/+ 


Gilt 


0 


York/Landrace 


491-01 


Sep 25/99 


+/+ 


Gilt 


2.3 


York/Landrace 


491-02 


Sep 25/99 


+/+ 


Gilt 


0 


York/Landrace 


491-03 


Sep 25/99 


+/+ 


Gilt 


0.3 


York/Landrace 




oep Z^i^y 




rjoar 


U 


I orK/ i^anorace 




oCp J.Ofz^)7 




rsoar 


u 


1 orK/L^anarace 




ocp X'O/y^ 




rsoar 


i j>o 


I orK/L^ancirace 




oep Zo/yy 


-4-/4- 


r>oar 




1 orK/ 




yjfwr m /Qo 


-r /-r 


rsoar 




I orK/ l^anUiaCC 




iNOV UZ/!?j' 


"T /-r 


JDOar 




X oriwdiurc 




JNOV lo/>'5^ 


-4-/-(- 
-r/-t- 


VJllt 


X.J 


X orKsmre 


613-02 


Nov in 


-/+ 


Gilt 


0 5 


Yo rk/Lan drac e 


613-03 


Nov 27/99 


-/+ 


Gilt 


0.3 


York/Landrace 


647-01 


Dec 13/99 


-/+ 


Gilt 


0.5 


York/Landrace 


647-03 


Dec 13/99 




Gilt 


16.3 


York/Landrace 


647-04 


Dec 13/99 


-*/+ 


Gilt 


0.5 


York/Landrace 


647-08 


Dec 13/99 




Boar 


0.4 


York/Landrace 


647-09 


Dec 13/99 


+*/+ 


Boar 


1.92 


York/Landrace 


668-01 


Dec 17/99 


+*/+ 


Gilt 


489 


Yorkshire 


671-02 


Dec 19/99 




Boar 


6.9 


York/Landrace 


671-04 


Dec 19/99 




Boar 


325 


York/Landrace 


675-03 


Dec 21/99 




Gilt 


2.1 


York/Landrace 


675-04 


Dec 21/99 




Boar 


42.6 


York/Landrace 


675-06 


Dec 21/99 




Boar 


5.0 


York/Landrace 



The number preceeding the dash represents the litter number and the nimfiber following the 
dash is the pig number within the litter. 

^All PGR assays were conducted with the primer APP A-up2-APPA-Kpn. Assays indicated 
with a star gave a negative result with the primer pair. However these samples gave a positive 
result for the primer set APPA-d4'Lama*upl . Samples 613-02 and 613-03 were negative with 
the latter primer set. 

"^Saliva was sampled and assayed for phytase 2 to 4 days after birth of the piglets. 

"^Zygotes used for microinjection were collected from superovailated Yorkshire or Yorkshire- 

Landrace cross gilts. 
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Table 10. Phosphorus content of feces collected from pigs producing a salivary phytase and 
non-transgenic pen-mates^ The data was subjected to a T-test analysis and the data recorded 
below. 





Mean Fecal 
Phosphorus 

(%) 


SE 


Relative reduction 
m fecal 
pnospnorus (%) 


t 


t(l%) 


1. 167-02 Grower Diet (122 days): 


1.59 




2AA1 






Non-transgenic (n=4) 


2.11 


0.0604669 




8.517 


4.6 


2. 167-02 Finisher Diet (154 
days): 


1.97 




16.97 






Non-transgenic (n=4) 


2.37 


0.0240767 




16.717 


4.6 


3. 282^2 Grower Diet (93 days): 


1.85 




12.90 






Non-transgenic (n=5) 


2.124 


0.02223 1 964 




12.324 


4.03 


4. 282*02 Finisher Diet (145 
days): 


1-76 




16.03 






Non-transgenic (n=5) 


2.096 


0.099153384 




3.389 


4.03 


S. 282-04 Grower Diet (93 days): 


L95 




8.19 






Non-transgenic (n=5) 


2.124 


0.022231964 




1.821 


4.03 


6. 282-04 Finisher Diet (145 
days): 


1.56 




25.57 






Non-transgenic (n=5) 


2.096 


0.099153384 




5.406 


4.03 


7. 421-06 Starter H Diet (40 
days): 


1.17 




27.47 






Non-transgenic (n=5) 


1.612 


0.086155741 




5.140 


4.03 


8. 421-06 Start HI Diet (48 days): 


1.57 




18.01 






Non-transgenic (n=5) 


1.915 


0.102884789 




3.351 


4.Q3 


9. 421-06 Grower Diet (81 days): 


2.00 




13.28 






Non-transgenic (n=5) 


2.310 


0.151658823 




2.022 


4.03 


10. 421-06 Finisher Diet (136 

days): 


1.71 




21.20 






Non-transgenic (n=5) 


2.173 


0.053023237 




8.687 


4.03 


IL 405-02 Starter D Diet (40 
days): 


1.81 




26.97 






Non-transgenic (n=5) 


2.482 


0.173625623 




3.856 


4.03 


12. 405-02 Starter ni Diet (48 

days): 


1.54 




36.58 






Non transgenic (n^) 


2.430 


0.104642248 




8.496 


4.6 


13. 405-02 Grower Diet (80 days): 


2.26 




18.19 






Non-transgenic (n=4) 


2.763 


0.124724697 




4.029 


4.6 


14. 405-02 Finisher Diet (136 

days): 


2.26 




13.24 






Non-transgenic (n=4) 


2.605 


0.217198066 




1.588 


4.6 


'Fresh fecal samples were collected on 3 different days was freeze-dried and then dried to constant 
weight at 1 lO^'C for 24 h, and analyzed for total phosphorus. 
^At the 5% level of confidence t=2.57. 
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Table 1 1 . Phytase activities of the first generation (G\^ transgenic offspring obtained by the 
crossing the phvtase positive boar 167-02 with non-trans genie Yorkshire gilts 



ED # of pig 


Birth Date 


Sex 


Salivary phytase (U/ml) 


Specific Activity 
U/mg protein 


151-01 


Mar 16/00 


F 


1193 


126 


151-02 




F 


736 


63.3 


151-05 




M 


710 


109 


151-07 


« 


M 


8019 


315 


152-04 




M 


10077 


364 


152-09 


«« 


M 


3054 


200 


154-01 


Mar 19/00 


F 


2472 


256 


154-03 




F 


6425 


706 


1 54-04 


c« 


F 


n.d. 


n.d. 


154-05 


«« 


M 


2767 


213 


154-06 


cc 


M 


341 


39 


154-07 


C( 


M 


4029 


142 


154-08 


cc 


M 


1184 


47.4 


159-03 


Mar 20/00 


F 


1563 


116 


159-04 




M 


2285 


1 201 


'The number of males and females (M/F) in each litter were 5/3, 7/2, 5/4, and 2/3 for litter 
numbers 151, 152, 154 and 159, respectively. Saliva was collected fix»m the piglets on day 
11. 
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Table 12. Primers used for construction and detection of transgenic constructs. 



Name 



Start-End' 



Forward/ 
Reverse 



Primers used in RlS/APPA+intron and R15/APPA construction 



APPA- 
D0WN2 



R 



TCGGCGCTCACCTTGAGTTC 



APPA- 
DRA 



CCGTTTAAAGCCATCTTAATCCCAT 



APPA- 
SMA 



R 



G TCCCGGGT ATGCGTGCTTCATTC 



CAT-ATG 



R 



CCATGG TGGCGGClll'lAGCTTCCTTAGCT 
CCTGA 



CAT-TAA 



AGCQCITGCAGTTTGTAAGGCAGTTATTG 
GTGCCC 



CAT-UP 1 



TCG AGG AGC TTG GGG AGA TT 



R15-UP1 



nrCGGGCCAATGTTGCTGT 



Primers used in SV40/APPA-Hntron construction 



SV-HIND 



CCCAAGCTTTACACnTATGC 



SV-XHO 



R 



GCCCTCGAGCCTCCTCACTACTTCT 



_ 



Primers used in Lama2/APPA and Lama2/PSP/APPA construction 



APPA- 

CLA 



12635-12657 



GGATCGATAAAAGCCGCCACCATGAA 



APPA- 
D0WN2 



13307-13326 



R 



TCGGCGCTCACCTTGAGTTC 



APPA- 
DOWN4 



12751-12780 



R 



GCACGCACACCATGACGACTGACAATCAC 
C 



APPA- 
KPN 



13935-13959 



R 



CGGGTACCTTACAAACTGCAAGCGG 



APPA- 
MATURE 



12719-12738 



CAGAGTGAGCCGGAGCTGAA 



APPA- 
UP2 



13210-13229 



CGAACTGGAACGGGTGCTTA 



LAMA- 
CLA 



12615-12639 



R 



GCATCGATCTTTGGTTCTGACAAATGG 



LAMA- 
SIGNAL 



R 



TGACTCTGAGTTCCCAATGA 



LAMA-UP 



12111-12130 



GTGCTGCTCCAAGTTTGGTG 



Primers for detection of the porcine P-globin 



PIG-BGF 



gene 



GCAGATTCCCAAACCTTCGCAGAG 



PIG-BGR 



R 



TCTGCCCAAGTCCTAAATGTGCGT 



1 The location of the primers shown for Lania2/APPA sequence. 

The start and stop codons oiAPPA are indicated in bold letters, the optimal initiation sequence for 
translation is italicized, and the restriction sites for restriction enz>'mes are underlined. 
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THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE 
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 

1 . A transgenic non-human animal that carries in the genome of its somatic and/or genu 
cells a nucleic acid sequence including a heterologous transgene construct, said construct 
including a trangene encoding a protein, said transgene being operably linked to a first 
regulatory sequence for salivary gland specific expression of said protein. 

2. The animal of claim 1 wherein said first regulatory sequence comprises a saliva 
protein promoter/enhancer sequence, whereby said animal expresses said protein in its saliva. 

3. The animal of claim 1 wherein said animal is a mammal. 

4. The animal of claim 3 wherein said animal is chosen fi"om the group comprising pigs, 
goats, sheep, cows, horses, rabbits, rodents, cats and dogs, and in addition, fish and poultry. 

5. The animal of claim 1 wherein said saliva protein promoter/enhancer sequence 
comprises a parotid secretory protein (PSP) promoter/enhancer, a proline-rich protein (PRP) 
promoter/enhancer or a salivary amylase promoter/enhancer. 

6. The animal of claim 5 wherein said promoter/enhancer is a parotid secretory protein 
(PSP) promoter/enhancer. 

7. The animal of claim 6 wherein said parotid secretory protein (PSP) 
promoter/enhancer is derived fi'om a mouse. 

8. The animal of claim 5 wherein said promoter/enhancer is a proline-rich protein (PRP) 
promoter/enhancer. 

9. The animal of claim 8 wherein said proline-rich protein (PRP) promoter/aihancer is 
derived fi*om a rat. 
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10. The animal of claim 1 wherein said transgene is further operably linked to one or 
more second regulatory sequences including enhancers, transcription regulatory sequences, 
teimination sequences, and polyadenylation sites. 

1 1 . The animal of claim 1 wherein said transgene comprises a gene encoding a protein 
haviQg ph3^ase activity. 

12. The animal of claim 1 wherein said transgene encodes a phytase or a homologue 
thereof. 

13. The animal of claim 1 wherein said animal is a pig, said transgene comprising a gene 
encoding a protein having phytase activity and wherein said first regulatory sequence 
comprises a parotid secretory protein (PSP) promoter/enhancer or a proline-rich protein 
(PRP) promoter/enhancer. 

14. The animal of claim 1 wherein said transgene construct comprises a nucleic acid 
sequence according to SEQ ID NO:3, SEQ ID NO:5; or SEQ ID NO:7. 

15. A transgenic non-human animal that carries in the genome of its somatic and/or germ 
cells a nucleic acid sequence including a heterologous transgene construct, said constmct 
including a trangene encoding phytase or a homologue thereof. 

16. The animal of claim 1 5 wherein said transgene is operably linked to a first regulatory 
sequence for salivary gland specific expression of said phytsise. 

17. The animal of claim 16 wherein said first regulatory sequence comprises a parotid 
secretory protein (PSP) promoter/enhancer, a proline-rich protein (PRP) promoter/enhancer 
or a salivary amylase promoter/enhancer. 

18. The animal of claim 17 wherein said animal is a mammal. 
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19. The animal of claim 18 wherein said phytase or a homologue thereof is expressed in 
saliva or in the gastrointestinal tract of said animal. 

20. The animal of claim 15 wherein said transgene construct comprises a nucleic acid 
5 sequence according to SEQ ED NO:3, SEQ ID N0:5; or SEQ ID N0:7. 

21. A method of expressing a protein, the method comprising the steps of: 

a) introducing a transgene construct into a non-human animal embryo such that a non- 
human transgenic animal tiiat develops from said embryo has a genome that comprises said 

10 transgene construct, wherein said transgene constmct comprises: 

i) a transgene encoding said protein, and 

ii) at least one regulatory sequence for gastrointestinal tract specific expression 
of said protein, 

b) transferring said embryo to a foster female; and, 

15 c) developing said embryo into said transgenic animal 

wherein said transgene is produced in the gastrointestinal tract of said animal. 

22. The method of claim 21 wherein said regulatory sequence provides for salivary gland 
or pancreatic gland specific expression of said protein. 

20 

23. The method of claim 21 wherein said regulatory sequence provides for salivary gland 
specific expression of said protein. 

24. The method of claim 23 wherein said salivary gland is a parotid gland, submaxillary 
25 gland, or a submandibular gland. 

25. The method of claim 23 wherein said transgene is expressed in the saliva of said 
animal. 

30 26. The method of claim 21 wherein said transgene is heterologous. 
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21. The method of claim 21 wherein said at least one regulatory sequence comprises a 
salivary protein promoter/enhancer sequence. 

28. The method of claim 2 1 wherein said protein is a glycoprotein. 

5 

29. A transgenic animal adapted for expressing a protein according to the method of 
claim 21, or a progeny thereof 

30. The method of claim 21 wherein said protein is a phytase or a homologue thereoE 

10 

3L The method of claim 21 wherein said transgene construct comprises a nucleic acid 
sequence according to SEQ ID NO:3, SEQ ID NO:5, or SEQ ID NO:7. 

32. A process for producing a protein comprising the steps of: 
a) obtaining saliva containing said protein firom a non-human transgenic animal, said 

animal containing within its genome a transgene construct, wherein said transgene construct 
comprises: 

i) a transgene encoding said protein, and 

ii) at least one regulatory sequence for salivary gland specific expression of 
said protein, and 

extracting said protein from said saliva. 

33. The process of claim 32 wherein said transgene is heterologous. 

25 34. The process of claim 32 wherein said at least one regulatory sequence comprises a 
salivary protein promoter/enhancer sequence. 

35. The process of claim 32 wherein said protein is a glycoprotein. 

30 36. The process of claim 32 wherein said transgene constmct comprises a nucleic acid 
sequence according to SEQ ID NO:3, SEQ ID NO:5; or SEQ ID NO:7. 
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37. The process of claim 32 wherein said protein is a phytase or a homologue thereof. 

38. The process of claim 32 wherein said salivary gland is a parotid gland, submaxillary, 
or a submandibiilar gland. 

5 

39. A method for expressing a phytase or a homologue thereof in a non-human animal, 
said method comprising: 

a) constmcting a micleic acid sequence including a transgene construct comprising: 

i) a transgene encoding said phytase or a homologue thereof, and 
10 ii) at least one regulatory sequence for gastrointestinal tract specific expression 

of said protein, and 

b) transfecting the animal with said nucleic acid sequence; 

whereby said animal carries within the genome of its somatic and/or germ cells said 
transgene construct and wherein said animal expresses said phytase or a homologue thereof 
IS in its gastrointestinal tract 

40. The method of claim 39 wherein said transgene constmct results in salivary gland or 
pancreatic gland specific expression of said phytase or a homologue thereof. 

20 41. The method of claim 40 wherein said regulatory sequence provides for salivary gland 
specific expression of said phytase or a homologue thereof. 

42. The method of claim 41 wherein said salivary gland is a parotid gland, submaxillary, 
or a submandibular gland. 

25 

43. The method of claim 41 wherein said phytase or a homologue thereof is expressed in 
the saliva of said marzmial. 



44. The method of claim 41 wherein said transgene construct comprises a nucleic acid 
30 sequence according to SEQ ID NO:3, SEQ ID N0:5; or SEQ ID N0:7. 
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45- The method of claim 39 wherein said nucleic acid sequence is introduced into said 
animal in the form of a transgene construct. 

46. The method of claim 45 wherein said transgene constract is a nucleic acid molecule. 

5 

47. The method of claim 46 wherein said plasmid comprises a nucleic acid sequence 
according to SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, or SEQ ID NO:6. 

48- The method of claim 39 wherein said animal is chosen from the group comprising 
10 pigs, goats, sheep, cows, horses, rabhits, rodents, cats, dogs, fish and poultry. 

49. The method of claim 48 wherein said animal comprises a mouse or a pig. 

50. A nucleic acid molecule comprising a nucleic acid sequence including a gene 

15 ^coding a protein, said gene being operably linked to at least one regulatory sequence for 
gastrointestinal tract specific expression of said protein. 

51. The molecule of claim 50 wherein said at least one regulatory sequence comprises a 
salivary protein promoter/enhancer sequence, whereby expression of said protein is salivary 

20 gland specific. 

52. The molecule of claim 5 1 wherein said salivary protein promoter/enhancer sequence 
comprises a parotid secretory protein (PSP) promoter/enhancer, a proline-rich protein (PRP) 
promotar/enhancer, a salivary amylase promoter/rahancer, or a SV40 promoter/enhancer. 

25 

53. The molecule of claim 5 1 wherein said protein comprises a phytase or a homologue 
thereof. 

54. The molecule of claim 53 wherein said molecule is a transgene construct. 

30 

55. The molecule of claim 54 wherein said molecule is a nucleic acid molecule. 
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56. The molecule of claim 55 wherein said molecule comprises a nucleic acid seqtteiice 
according to SEQ ID NO:l, SEQ ID NO:2, SEQ ID N0:4, or SEQ ID NO:6. 

57. The molecule of claim 53 wherein said molecule includes a nucleic acid sequence 
5 according to SEQ ID NO:3, SEQ ID NO:5; or SEQ ID N0:7. 

58. An antibody specific to a protein expressed by a nucleic acid sequence according to 
SEQ ID NO:3, SEQ ID NO:5; or SEQ ID NO;7. 

1 0 59. The antibody of claim 58 wherein said antibody is monoclonal, 

60. The antibody of claim 58 wherein said antibody is polyclonal. 

61. A hybridoma secreting the antibody of claim 59, 

15 

62. A host cell transfected with molecule of claim 50. 

63. A host cell transfected with the molecule of claim 56. 
20 64. A host cell transfected with the molecule of claim 57. 

65. The host cell of claim 63 wherein said cell is an bacterial cell. 

66. The host cell of claim 64 wherein said cell is an animal cell. 

25 

67. A diagnostic kit for immunologically detecting a protein expressed by a nucleic acid 
sequence according to SEQ ID NO:3, SEQ ID N0:5; or SEQ ID NO:7, the kit including an 
antibody specific to said protein. 

30 68. The kit of claim 67 wherein said antibody is monoclonal. 

69. The kit of claim 68 wherein said antibody is polyclonal. 
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Figure 5. The nucleic acid sequence of the Lama2/APPA plasmid (SEP ID NO: l^ 



LOCUS 

DEFINITION 
ACCESSION 
KEYWORDS 



REFERENCE 

AUTHORS 

JOURNAL 

FEATURES 

DEFINITION 
ACCESSION 
VERSION 
SOURCE 
ORGANISM 



Iiama-appA 20623 bp DNA CIRCUL»AR SYN 17-aAN-2000 
Lama 2/APPA transgenic construct 
Lama 2 - appA , 

parotid secretory protein; acid glucose -1 -phosphatase; appA 
gene ; 

periplasmic phosphoanhydride phosphohydrolase ; artificial 
sequence; 
cloning vector 
1 (bases 1 to 20623) 

Golovan, S., Forsberg, C.W. , Phillips, J. 
Unpub lis hed . 

M. musculus Psp gene for parotid secretory protein. 
X68699 

X68699.1 GI:S3B09 
house mouse. 
Mus musculus 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata ; Mammalia; 
Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 3777 to 5332;) 

AUTHORS Svendsen , P . , Laursen , «J . , Krogh - Pedersen , H . and H j ort h , J . P , 
TITLE Novel salivary gland specific binding elements located in the PSP 
proximal enhancer core 



JOUl?NAL 

MEDLINE 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 



REFERENCE 
AUTHORS 



Nucleic Acids Res. 26 (11) , 2761-2770 (1998) 
98256451 

2 (bases 7147 to 12653; 13952 to 17731) 

Mikkelsen, T.R. 
Direct Submission 

Submitted (07-OCT-1992) T.R. Mikkelsen, Department of Molecular 
Biology, University of Aarhus, CF Mollers Alle 130, 8000 
Aarhus , DENMARK 
3 (bases 7147 to 12653; 13952 to 17731) 

Laursen J, Hjorth JP 



TITLE A cassette for high-level expression in the mouse salivary glands 



JOURNAL 
MEDLINE 



Gene 1997 
9370303 



Oct 1;19B (1-2) :367-72 



FEATURES 



misc feature 



enhancer 



exon 



exon 



misc feature 



Location/Qualifiers 
source l.to 12653; 13952 to 17731 

/organism* "Mus musculus** 
/strains " C3H/As " 
/ db_xr e f = " t axon : 1 0 0 9 0 " 
/ chromosomes " 2 " 

/maps "Estimate: 69 cM from centromere" 

/clone=" Lambda YPl, Lambda YP3 , Lambda YP7" 

/clone_lib="Larobda-PHAGE (Lambda L47.1)" 

/germline 

/note= "Allele: b" 

3777-5332 
/gene=*'PSP" 

/function^" salivary gland specific positive acting 
regulatory region" 
7147. .8724 

/ evidence ^experimental 
11778 . . 11824 
/gene= "Psp" 
/note="exon a" 
/ number =1 

/ evidence=experiraental 
12626.. 14190 
/gene- "Psp" 

/note- "exon b fused with 
12644-12652 



exons h and i 
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Figure 5 (continued): 

/function^" consensus sequence for initiation in higher 
eukaryotes * 
niisc_f eature 13952-13965 

/function*" M13mplB polyl inker" 

DEFINITION E. coli periplasmic phosphoanhydride phosphohydrolase (appA) gene. 



ACCESSION 
VERSION 
SOURCE 
ORGANISM 



M58708 L03370 L03371 3!i03372 li03373 1,03374 L03375 
M58708.1 61:145283 
Escherichia coli DNA. 
Escherichia coli 



Bacteria; Proteobacteria ; gamxna subdivision; Enterobacteriaceae ; 
Escherichia . 



REFERENCE 



1 (bases 12653 13951) 



AUTHORS 
TITLE 



JOXJRNAL 
KBOLINE 



Dassa,J., Marck,C. and Boquet,P.L. 

The complete nucleotide sequence of the Escherichia coli gene appA 
reveals significant homology between pH 2.5 acid phosphatase 
and glucose- 1 -phosphatase 

J. Bacterid. 172 (9), 5497-S500 (1990) 

90368616 



FEATURES 

Source 



si9__peptide 
/gene= " appA " 
CDS12653 



Location/Qualifiers 

12653 . .13951 
/organisms "Escherichia coli" 
/ db_xref = " taxon -.562" 
12653. •12718 

13951 

/gene=**appA" 

/standard_name= "acid phosphatase/phytase" 
/ 1 rans l_t abl e= 1 1 

/product*" periplasmic phosphoanhydride phosphohydrolase" 
/protein__id= "AAA72086 . 1 " 
/db xref=5"GI = 145285" 



/translation«"MKAIIiIPFLSLLIPLTPQSAFAQSEPELKLESVVIVSRHGVRAP 

TKATOIiMODVTPDAWPTMPVKLGWLTPRGGELIAyLGHYQRQRLVADGLLAKK^ 

GQVAI lADVDERTRKTGEAFAAGIAPDCAITVHTQADTSSPDPIiFNPLKTGVCQLDNA 

NVTDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSNIjCLKREKQDESCSLTQALPS 

ELKVSADljJVSLTGAVSLASMLTEIFIiLQQAQGMPEPGWGRITDSHQWNTtiLSLHNAQF 

YLLQRTPEVARSRATPLLDIiIKTAIiTPHPPQKQAYGVTLPTSVLFIAGHDTNIANLGG 

ALEmWTLPG0PDNTPPGGELVFERWRilL.SDNSQWIQVSI/VFQTIjOQMRDKTPL.SI^T 

PPGEViCLTLAGCEEPJJAQGMCSliAGFTOIVNEARIPACSL" 



mat^pepcide 



mutation 



mutation 



mutation 
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12719 13948 
/ g ene = " appA " 

/products "periplasmic phosphoanhydride phosphohydrolase" 

replace (12659 . . 12661, "gcg changed to gcc") 
/gene="appA" 

/ s t andard_name = "A3 mutiant " 

/note=" created by site directed mutagenesis" 
/citation^ [3] 

/phenotype= "silent mutation" 

replace (13 934 . .13936 , *• ccg changed to ccc") 

/gene= " appA " 

/standard_name=" P42 8 mutant" 

/note= "created by site directed mutagenesis" 

/ citation= [3] 

/phenotype= silent mutation " 

replace (13 93 7. .13939, gcg changed to get") 

/genes "appA" 

/standard_namea=" A429 mutant" 

/notes "created by site directed mutagenesis" 
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Figure 5 (continuedl: 



/citation^ [3] 

/phenotype= " silent mutation 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
aOXJKNAXi 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

MEDLINE 

REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

MEDLINE 

FEATURES 

Source 



CDS 



pBluescript II KS(+) vector DNA, 
X52327 

X52327.1 GI:58061 

artificial sequence; cloning vector; expression vector;, vector, 
synthetic construct - 
synthetic construct 
artificial sequence. 

1 (bases 17732 to 20623) 
Thomases. A. 

Direct Siibmission 

Submitted (20-FEB-1990) Thomas E.A. , Stratagene Cloning 
Systems, 11099 North Tomey Pines R<i. , La Jolla^ CA 920377— USA 

2 (bases 17732 to 20623) 

Short r J. M., Fernandez , J . M . , Sorge^J.A. andHuse,W.D. 

Lambda ZAP: a bacteriophage lambda expression vector with in 

vivo excision properties 

Nucleic Acids Res. 16 (15), 7583-7600 (1988) 
88319944 

3 (bases 17732 to 20623) 
Alting-Mees,M. A. and Short ^ J. M. 
pBluescript II : gene mapping vectors 
Nucleic Acids Res, 17 (22), 9494 (1989) 
90067967 

Locat ion/Qual if iers 
17732 to 20623 

/organisms "synthetic construct" 
/ db_xr e f = " t axon : 3 2 6 3 0 
complement (18967 19827} 
/genes "Amp" 

/products "b- lactamase" 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1S21 
1681 



5449 a 4847 C 4902 g 5424 t 



TCGAGAGTAT CTTTGTCAGC 
ATCTAAACTA ATTAATTAAT 
TGTTGAACAA GTTCTCCAAA 
CTGAGGAGAC ACCTGCATCT 
AGGGTGGTTC TGTGGGACAG 
AAGCTACCCC AAACGACAGA 
GCCGGACAGT GAGACAGACA 
AGGGATTGAG AGACCCTGAC 
ACAAAGCT6C CAAAGACCAA 
ACAGCATAAT AAGCAGAGTG 
ATAAAAGGAC AGTATTACAG 
TTTAAGTAGG GTAAAGTACT 
GTCTCTTACT GTTTAAATGA 
GGACAATATA TATTTAGAGA 
CACCAAGACT GCAGCACACC 
GTGGTGGTGA AGATGTACTA 
CACACTGGAG CAACCACTGT 
GCGGGGCGTG GTGGCATACA 
TCTGAGTTCC AGGCCAGCCT 
AAAAACCCTG CCTTGATTAA 
ACCAAACCAA ACCAAACCAG 
TCCTAGATAT ATACCCAATG 



ACTACACTGT 
GGATAGGTAA 
TCATTTTTCT 



TCACCACAGC 
CTTTCAAGGT 
TTATGAGGTG 



TGAAGATACT ACACTGGTCC 
OTATCCTTAC CATCATTTGT 



A.TGTAATATC 
TTCTTCTTTT 



AGTGTGAGGA 
GAAAACTGTC 



TGTGCCTCCA ACAAAGGGGT 
CCCTCACCCG CAAATCTTTC 
GGAGAGATAC AGATGAGTGC 
GACTAAGAAG AGCCACGGTG 
TAGAAAATCG AGAGGCATGT 
GATTGTCAGT CAGGCCAATC 
CACCTACTCA GTTGGAGGAA 
AGGCGCAAGG CCCTAACACA 
AGACTTGTTC TCCATTAGAA 
TACTCTGATT GGAGAACTTT 
ATTTTGTTGT ACACTGCTGT 
CTTTAAAAAT GGGTCCTAGA 
TTTTTATTTT GTTTAATATG 
AAGATGGTTA GCTGTCAGAA 
CCTGTCAGAT GGCTGTGATC 
AAGGGAAACA CACACACACA 
GGAAATCAGT ATGAATGGTC 
CTTTTATTCC CAGCACTGGG 
GGTCTATAGC ACAGGTTCTA 
ACCAAACCAA ACCAAACCAA 
ACCAAACCAA AACACTGAAG 
GAGACTAAGT CAGCAAGACA 
CAGGCTGTGG AACCAGCCTG 
AAATGGACTC TGCTGTGTAC 
TCCATTCAGG AGTCACATGG 
CCACAGTTTA CACTTTTATC 
TGTAATTTTT CTTGATGACC 
AGTACAACTT GTTTTCTAAG 
GGTTCCTGAC ATCTGCTCAG 
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ACTGTTGCCC 
AGTCACTAAG 
GTATAGGGTG 
TTAGTTGAAT 
GCCGTTTAGT 
CGTTTCGAGT 
GGATGAGAAC 
CACACCTACC 
ATGACAGCTG 
AATGTGTTTC 
TACATGTGGG 
TATTTTTTCC 
GAGGAAAAAG 
AAATATGCAA 
AAGAAAATAA 
CACACACACA 
CTCAAAAACC 
GAGGCAGAGG 
GGACAGCCAG 
ACCAAACCAA 
ATAGAACTTC 
CCTGCACAGC 
AGTGTCCATG 
ATGCCTCACA 
TAGTTCTATT 
AGCAGTGAAT 
CTCTTTCTGA 
TATTTATTGG 
GTATTCATTG 



ACATAGAAAG 
TTAGCACGAT 
GACCTGGCTG 
GGTGTGGAGT 
GAACTGATGG 
TTGATGGGCA 
AATGGCCAGC 
ACCTCACTTG 
GCTTGACCCG 
ATTCAGTATT 
GCAGTGTGTC 
TTTAACTCAA 
AAGCGTAAAT 
ATCAAAATCA 
ATGACAATGA 
CACACACACA 
TGAAGATAGA 
CAGGTGGATC 
GGCTACACAG 
ACCAAACCAA 
AGTATTCCAT 
CATGTTCACT 
ATAAATGAAT 
TTCTGTTTAT 
TTCAGTCTTC 
AAGGGTTCCT 
CAGGGATAGG 
CCCCTTGCAT 
GATGTTGTTT 
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Figure S (continued): 

1741 CTTTGGTGTT TGAGTTCTTA TGAATTCTAG ATGTTAAATC CCTGCCTGTG GTTCTCTCCC 
1801 ATTCTGTAGG CTGCCTCCTC ACCCTGGCAA TTGTTGTCCT TGTTTTGCAG AAACTTTTGA 
1861 CTTCATGGAA TCTCATTTGT CAGTTTTCCC TCCTCTGCTA TAGCCTGAGC TAATGCACTG 
1921 GTTTTTACAG AGCCCTGGTC TATGCCTTTA TCCTCCTCTG GCAGCTTCGG AGTTTCATTT 
1981 CTTACATTTA GATCTTTGAT CCACTTTGAA CAAGTTTTGG AGCAGGGTGA GAGATACGAA 
2041 TCTAGTTCCA TTCTTCCATA TGTGATCCTA GTTTACATAS CATCGTTGGT TGAAGAGGTT 
2101 TTATTTTATT TTTAAATAAT GTGTCATAAA AAACGAGGTG GTTGTAGCAG TGTGGATTTG. 
2X61 TTTCTTTGTC CTTTGATCTA CAGGTCTTGT TTTGTGTCAG TCTCATGATG TTTTATTGCT* 
2221 ATGGCTCTGT CATACAGTCT GAGGTCAGGT ATTGTGATAT ACCTTCAGTA TTGCTCCCTC 
2281 AGACTCAGGT TTGCTTTGGC CAGGAGTCAT CTTACTCAGT GCTCTTAGAG CTCCCCCAGC 
2341 ATGTAGCTGC TACTATTCTT AGTTGATAA?V TCAGGAAACT GGGGCTCAGA GAGATTAACT 
2401 GTCTTGAACT ACTTCTGGGG AGGTGAAACG TGGAGACACT AAACTGTGTT TACCCTGTAC 
2461 TGCTCCAGTA GCTGTCGGGT GCTGGGCTAC AGCAAAGCAC CTATACTATA TATTACTCAG 
2521 GAGGTGGAAA AACTCAGCCT CCCTTGGGGT TCCCAAGCTC CCAGGTGTCC AGTCACTGCT 
2581 GGAAACCTCA TGGAGTCTGA AAGGAAGGGT TGAGGGTACA TGGGGCAGCG ATGAGGAGCC 
2641 TGGGGCTGGG ATCTCCCAAA CACCTGGATA TCCAGATGCC ACTGGGTCAG GGGGAGTTGG 
2701 GAACAGAGTT GGGATGTCCA TGGACCTGTG ACAAGGCCAG GGCCAGGGGG AGGATAACTC 
2761 TGGCTTTACT AATTTGCGAA AGTCCTTAGC TTAGCAGCAG TTGTCTGGGA GCACAGAGGG 
2821 GCCTTCTGTA AGAGGCTCAG GCAGTGCCGC TCTGTAGGCG AAGGTCTTCT CCATGTTCCC 
2881 CATGGTGGTT CTTGATGAAA GAGACAGTCC TTGGCTCCAA ACTGGTTTAT TGATTGTTCA 
2941 TTGTGGAAAA TGGGTGCACA CCACCTTCTC AGGGTGGACC AGAGATCAAA TACCTTTTGC 
3001 AGGGAGGAAT ATCTGGGATi^ GGACGCTTAC TGGCTAAACC CTCAGGGCCT CTAGATACAT 
3061 CATTAGCATG GAGAACTCTG TTCTGGGCTA CATGACCACA GGCCACATTT CCACAAGCCA 
3121 CATGTGGGAA GTGTGGCACA TGTTCTAGGC CAGGAATCTG GTAGGGAGCG TGGAGCCACC 
3181 TACCATCCCA GGTGGGTGCC TGGGTGCCAG GGACCCTGAA CCCGCTCAAC CTTACCAAGT 
3241 TTCCTGGCAG GGTCCACTGT CCTACACAGA AGCTGGAGGA GGTGTGAGGG TTGTGTCTTT 
3301 GTGGAATGTC CCATGCTGCT TGGGGCTCAG TTTCTCCACC TGTACCTCAT TGGTTTGGGT 
3361 ATAAZUVAGTG GGGATACTTT ATTATTCTCT GACTCGGTCC TGAGGAAAAA GCATCGTGGC 
3421 AGTCCAGGAA CCACACCCTG AGGTTCCTGC ACTGAAGGGA CTCCCTAAGT CTCTGGAGTC 
3481 TCTCCCCTTC ACAGAGCTGC CAAAGTCTAG GTTCTTTTGA GGATAACAGA GCCATGCTTG 
3541 GTAAGCAGAC AACAGCATTT GTTTACTCAA CCTTCTTTTG TCAGCTCCCT CTTCATAAAC 

3 601 AAGTTGAGAC ACCATGCTGG CTTGAGGAAG ACTTCTAAAG CCAGACAACT GTGCAAGGAA 
3661 GAAGAAGAAG GGGCAAGTGG AGTTAGCCTG GATGTAGCCC TCAAAGTCTC CAGAGACCAG 
3721 CCATGAAGGC TCAAGTGGAG GGCAAGACCT GCAGCAGCCA AGCATCTGGC AGGAGAGGAT 
3781 CCTGGGAACC CCTCTACCAT GACACACATT CTTCCTGCAG GTCACACTTA ATAGGCCATT 
3841 TCTTATTTGG ATCTATCATG GTGTTCTGTG CGAGATTAAT GAGGTGTTAT GCTGCGAACA 
3901 GAAAGTTATA TAAAAACAAG TCCCCCCCCC TTGTCACTGC TGCTAAGAAT GTAGCAGAAA 
3961 TTGTCTCAAG TGTCTCTCTA ATCAGAAACA ATAAAGGTCT CCTTGGATTC AAGCCCTCCA 
4021 GTTTCCTCCT TCCTTGCTGA GCCTTGGACA CCCATACAAA CCTCCTGGAT GCTACAGCTC 
4081 TGGGCAGAGA CTCCAAGGTG GGGAGAGACT GATGGTACAA AAGCAAAATA CTTGTTTGGG 
4141 GGTACACCCA CTCCTCTGCC TGTGTGGTTC CTGCAGTCAG TCCTGCAGAC AGGCCCTCAG 
4201 TGGGTCTTCC ATGGGCAACA CGCAGAGGGA GGCAATGGAT GGGAATACCC ACACCCTGGT 
4261 TAGTTTACCC CGGCCATGCT CTCTGCTCTT CATCCCTCCT CTGCCCTCTG CCACGGCTTT 
4321 CTCTGCAGGA ATCATATCTT CATATTGGCC CACAGGTGTT CTCCTCACCC TAGCTATGAT 
4381 GTTTACTTTA GAGTGACCTT AGCAGGGCTG GTGGGAATGA GTTCTAGAAG GCTCACGGAG 
4441 ATGCTAGGGA AGAAACGTCT TCTAACTACT GAGGTTACTA AGTTCCTGGT GGTTGTCTCT 
4501 GCCTTTCCCT TGTTAAAGTC ACCTTGAAGT TAGTGCAGAA GAAATCAGAG CCCAGTCACA 
4561 GAGTAAATAT GGTCCTGAAG ATTTCCTTTG AGTGCCCAGA ATCCATGACA TTTCAAGAGC 
4621 CCTCTTTGTA CCTTAAGTCA TTTGGGGTTG TATCTTCTGC TTGATGTATG TGTGTGTGTT 
4681 TATCAAAGAG TGAGATGGTT ACATAAGAGG TGCTCTAAAG GACAGAGAGG ATTTGCAATT 
4741 GT6GCATGTG ACATCCTCAG GCCTTGCTCT GGTGCCAGGA GQAACTGATG CAGAAAAGAG 
4801 TAAGAGGTCA TTTCCTGGAG GCTGTCACTA TAGAGGAGAT CTTACAGTGC ATTCCCTCCT 
4861 CCAGGCCCTG CCTGAGGATA GACATGTGCT GACTGCAACT GAAACAGAGG CTTGGGATGG 

4 321 AGAGTTAGGT TCACAGAAGG GAGGGTGGGA GATGGATGCT TGCTGGGTTC TGGGTCTCAT 
4 981 CACCAGCTCC TGACCACCCG GTCAGCCCAT GTGC1TATTC CATAGCTTTC TTTTGCTATG 
5041 TTTACTCAGT GTGGTGTTTG TTGGGACCCA GCAGAAGCCA GTCCCAGGCT GACAGCTGTG 
5101 GATACACAGG GCAGCATGAG GGTCCTCAGC CTGAAGCAGT CAGGCTGGCA GAAGAGAAAG 
5161 ACCAGCACAC ATTCCTTCAA CCAACTATGT CTTGAAAAAC AAACATATTA TATCACATAT 
5221 ATTGCATTTA TGAGACAGCT AAAATGTACT CGGGTAGCAT GACTCCAGGT GGGGATATCT 
5281 GCAAGTGCCA TGAGTGGCAG AGGGACAGCC AATGTGAGGC AAGAAGGAAT TCTGGCTCAA 
5341 CACAGCTTAG CTCCCTGGTG TTGGTTCAAA CTTTGAGAGT TTGACCACAA GCACTTTATT 
5401 TTTGACATAT TTAAACAGAG CACAACTTTG GGAAAAAGTT TTCTTATGAA AATTATCACA 
5461 ATAAAGCTTA AGGCATGACT ACATTAAAAT GCCTTTGCAA AGTATATGTG CCCTCTTCCA 
5521 CAAGAATGGT TCTATTGACT GAGAAATAAT GTTCAGGATA AAGATCCAGG AAGAAAAGAT 
55S1 CAGGGATAAG TAAAATACTA AACTCTTTTG CAAAGTACAT AGACCCTCTT TCA.TAACAAT 
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Figure 5 (continued V 

5641 GGGTTCTATT GACTGACAAG CACTGCTCAG GAGTTGGGAA AGAGTCTAGC ATAAGCACGA 
5701 TAGCCTGGAG ACTCTAGTGA GGTCTAGTCT TACAGACAGC AAAAATCACC AGGTTACAAA 
5761 CTACATTCAT TTCCAGTTTT CTGATCAGGC ACAGGTATGA ATCCCTTCTG TTGAAGAGAA 
5821 AAGTCCATGT GTTTAAAATA TCTGGTTTCT CCAGTGCTAT TAGCGAGAAG ACTTGAGCCC 
5881 TATACAACTC CCACCTGGAG TGACATCCTG TCTTCATGGT ATATTACATA CCTAGACACG 
5941 CTCATCTCAC AGACTTAGGA CTTTGTCTTC TGATCTCCAT TTCTGATCCC ACTTCCACCT 
6001 TTGCCTTGAT AGTGTCATTT TCTTCACTGC CTTGGTGACA ACCATGTTAT CCTCTGTGTA 
6061 TTTGAGTGTT ACCATTTTCA GATTTTACCT GTATGCAAGA TCACACAGTC TTTGTCTTTC 
6121 TGTCTGGATG CATGCTAATC TCTACACAAC AACCCTTCCC CGTCACTCAG ATCTTCCTCC 
6181 ATTAACACAT ACATGGTGCT GAAGAGGCTA GGGAGCTTCC CTTCAGTGGG GAGCTAGCTG 
6241 GCTATTGGGC CTTTTTGACT GTCCAGGAAG GCCCCCAATT GCTGAGACAA GAACTTAGAT 
6301 TCTTCATTAT TGACTCTAAC TCATGTATCA AGCAGAAGCT AATGAATAGT TATCAACAGG 
6361 ATCAGAGGTT CCAGTGTAAG ACACTTTGAC ATGAAAGAAC G6AGGAAGGA CAGATGGATG 
6421 CATAAAAGCA GGACCACTGC CCCASGAAGG TCCTGGAAAC TGATGCAGGG CA^GGACAG 
6481 GTTATAAACC AAATCTTASG GAGTCAGGAA GAGCACA6AG GAGCTCAACC AACTGACCAC 
6541 TGCTTAGGGG CTACCAACCC AATCCTCCCT GTGGGAACAG CTAAGCTATC AGCCAAGGGT 
6601 AATAAACAGG CAGGACCTGT GGATGACATG GAGAGCATAG GGACCCTGGG TCCAGCCTTT 
6661 AGCACCTGCA CTCTCAGGAT ACTCCACCAT TGTGTCTTAG AGAGCCTAGG GATACTGGGT 
6721 CCAGCCTTTG GTACCTTCAC TCTCAGGGTA CCCCATCACT GTGTCTTGGA GAGCCTAGGC 
6781 ACCCTGGGTC CAGCCTTCAG TACCTGCGCT CTCAGGACAC CCCACCATTG TCTCTTGCCC 
6841 CGTCTCTTCT TCCTCTTCCT CCCTTTCATT GTCTCTTCTC TGTTTCTTTC TTGACTCTCC 
6901 TTTCCCCTCA CACCCTCACT CTAGTTCTCC CCTTCCCTCT CTGCATCACC CTATTCTCTC 
6961 TGTGGTCCCT CCACTTTCCT TTATCTCTCA TGCTTCTCTC CTCCCTCAAA TACTTGTCAC 
7021 CCACTATACT TCAGGGGCCA GCTCTAGTGA CAAAGCTGTT AATAGCAAGA CTCTCAGATC 
7081 TCCAACGGCT CAGAGGAGCC AGACCCACCA AGAACTCTCT CCAGGTCCAA TTTCAGGTTC 
7141 CTTCGAAAGC TTTCAGCAAA TGCTCAGGGA ACATGCCACT AACAAGAAGA TGCAAATTCC 
7201 AGTTGAGAGT GGGAAAGGCC CTTGCGTAGG TCCCATCTTC CAGGCCAAGG TCAGAGGGGC 
7261 TCTGTGTAAT CCGGATTGAC AGGGCTCAGA ACAATGTTTT GTTTTTAAGG TTTATTTATT 
7321 TTAGGTGTTA GTGTCTTTGC TTGCATGACC TTATGTGCAT CATGTGTGTG CAGGTTCCTG 
7381 ATGACA6TA6 AGGAGGGCTT TGAATCCCTG GGGATAGGAA GTTACAGGAA ATTATAAGCT 
7441 GCTTTGTGGG TCTTCTAGCT TTCCCAACAG AAGTGAATGC TCTTCACCAC TGAGCCATCT 
7501 CTCTAGGCCC AAGAGACATT GCTTTATGGA TATAATTGTG TGTGTGTGTC AACATTGAGG 
7561 AAAGGGAAAT AAAAAAAAAA CTTCAGCCGC TAAGGTTGTA CAGTTTCACT AATTGCTACT 
7621 TTTAGTTGTG ATAAAATGGC AG6TGCTTCA ACATTTATAT ATACAAAAAC TTCCCTGCTG 
7681 GTGGTTCAAC TGTGAGAACT GGGGTAAGTG GGTGAGTTCT CTTTTTCTGT CTCTGTCTCT 
7741 GTCTCTCTCC TTCCATTCTT TCTTAAAGGA AATAAACATT GCAGCTGGGT TATAGCTCAT 
7801 CAATATGGAA GTTACAGAAG TGAAAAAAGG CATTGCCTTG GTGGGTGGTG TTACCAGCTG 
7861 ATTTTTGGTT GTCCTGCAAG GAGGTCTGGG GACTGGCTGC TCTGTCTCTG TCTGTATGAG 
7321 TGAGGGAAGT CTGGGGAGCA GATTCCCTAA CCTTCAGCCT GGCCTGGTTC CTGAGTGAAC 
7981 CCAGCCTCTC TGGTCCTAGT AGCTTTTTCC AAACAGGAAT CTGAGTGGTG ACAGGGAACA 
8041 AGTACCAGCC CATTGCTTAA GTGCCAGGGT TAGTGAGGGC AGGAAGCTGC CATAGCTGGG 
8101 ATTAGTAGTT GTATTGGATG TAGGAAGTCC TATCCTGGGA CAGCTAATCC TTAATGCTTC 
8161 ACTGGAGATT TTCAATGAGA AATTTATCCC ACGGCCCATA TGGCCCCATC CTTTTGTCTC 
8221 CAACAGCCAA GTATTTTCCA TTAGAGGAGA CTTCCTGTAC ACTTGATGGA TGCTCATTCC 
8281 AAGGTGACTT GGGGCAGTCA GTACAGACTT GGGATGACCT CTGACAGCCT AACCTCTCCC 
8341 CAACAAGGGC CCTCTATGTT TGCTATGTAA TGTAATGTCA GACATTGTCA GGAGTGTCCG 
8401 CAGCACAGCC TGCCCAGTGT GAGGGCTCTC ATAGGTTTCC CACTGTCTTA TCTACACAGG 
8461 GATAACGAGG AGGTAAGCTG CAGTTCCCAG TCTCACTTCA CAGAGGAAGA GATAACCCCA 
8521 TCCCAGGTCA TGTAGCCAGC AGTGGAAAGA ATGAGGATTT GAACTCAGGT CTTCCAAGTC 
858 1 CCATTGATAG CATCTCCTCA CAAGTCCCTT GCCACCCTCA CGATGCCTTA GACACTTGCC 
8641 TGCCCTTTAT ACTAAGGAGA TGCAGGTACA AGGGGTTTAC CCATGTAGCA GCTGAGGCAG 
8701 CTGGGGATAG ATACCAGCAG CAGGCCTGAT GTCACCACTC TAACTCCAGC ATCCCCAGTC 
8761 TGTGTTCCTG GAGTGTGAAA ATCCCTACTT AACAAGATTG TGCAACAGTC CTTGGCTCTG 
8821 TGACCCATAG CTGGAAACAG GATTCTCATT GATTTGTGGA ACATGGTGGC AGCCAGCCAA 
8881 AAAGAGGGTC TGCATACAGA AGACACGTGT GGCAAGGCCA CAGCAGACTC TGACTACCTT 
8941 AGCTTACAGA ATTACAAGGT CATAATGTCC TCTGCTTTGG TCACCTCATG TTAAGGACAG 
9001 GCCCTAATGA AGATGGGGCA GAAGACTGAA GGAATGGCCA ACCAATAACT GGCCCAACTT 
9061 GAGACCCATC CTACAGGCAA GCATCAATTC CTGACACTAC TAATGATACT CTGTTATGCT 
9121 TGCAGACAGA AGCCTAGCAT AACTATCCTC CGAGAGGTCC ACCCAGCAAC TGACTGAAAC 
9181 AGAAAAAGAT ATCCACAGGC AAACAGTGGA TGGAGGTCAG GGACTATTAT GGGAGAGCTG 
9241 TGGGAAGGAT TAAAAACCCT GAftGGGGATA GGAACCCCAC AGGAAGACCA ACAGAGTCAA 

93 01 CTAAGAGACC TGTGGGAGCT CTCAGAGACT GAGCCACCAA CCAAAGAGCA TACACAGGCC 
9361 GGTCCGAGGC ACCTGGCACG TGTGAAGCAG ACATGCAGCT CAGTCTCCAT GTAGGTCCTC 

94 21 CAATAAGCGG TAGCCTGACT GCAGTATCCA ATCCCCAAC^^ GGGCTGCATA GTCTGGCCTC 
94 81 AGTGGGGGAG GATGCCCCTA ATCCTGCAGA GACTTGATGA GTGGAGAGCT ATCCAGGGGG 
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Figure 5 (continuedl: 



9541 AACCCACCCT CTCTGAGAAG GGAATGGGGA TGGGGGAGGG ACTCTGTGAA GAGGGGACAA 

96 01 GGACAAACAA GAACCTCAAA TAGGTCAGGC CCTAAAGGCT TGCTAAGTAG CAGTGGCCCA 

5661 GCTCTGTCCT GTTCCTCAGC CCAAGGCTCA GCTCCCACCT GTTTCTGTGT TTTTCTGGCT 

9721 TTTCATGGGC CTAGGACTTG GTGACCAGTT CAAACAATGG GGCCTGTGGA AGACACAATA 

9781 TACAAGACTA G6GACATTCC TGTTCTGCTG ACTATCCATA GCCTGATGTA GGTGGAAGGA 

9841 CCCAATCACT GQATTTCTAC CCTTGCACAA CCTTGACAGC TGAGGGCCTC TCAGAAACCT 

9901 ATTTCTTCCA CTGAAAAATG AGACTCTCAA ATGAACGTCG TGACAATCAT CAGGCTTATT^ 

9961 AAAGAGGTGT ATCTAACCTG AATGGCAAGC AGACAGCAGG CAAATGTCTG TATCAACCTC 

10021 TAGGAAGGAC AAGAACTGCT CACTGCT6CC CCCCAGGAGG CCATTTGCTG AAACAGCTGC 

10081 TCTCCTGCTG GTGCACAGGC CCTGCCTTCT CATTGCAGCC ACAGCCCCTT CCTGTCTGAA 

10141 CCTCCTGTCA GGTCACTGGG AAACAGATCA AGATGGAACA GGACAGCTCC TGATGGTAAA 

10201 TAAAAAACAG TGGTCATGGC TATTCATAGG GGTTTATGCT TCTTCAGTCC ACACTGTGAA 

102 61 GAGCTGTGGG CATGAACCAC AGTGTTCGAG GTAGAGTTGG GGTTCTGAAA TTCACAGTGG 
10321 GGTGAGCTCA GTAAATGTGA GCTGGAGGTC ACTCGTGAGA CACACAGTCC TGCTGCTTCT 

103 81 GTTCCCAATA TCCTGAGGAG ACGACACATC TACTTTGTTC AGAGGCCACA GTCTAGTTGA 
10441 CCTGAGAGTT ACCAGTTTCT TATTTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG 
105 01 TGTTGTTCGT GTGTGAGTGC AGGTGCACAT ATGATAGCGT ACACGTTGAG GTCAGAGGAT 
10561 AACTATCAGG CGTTGTCCCC TCCTACTTTT CCTCGGACTC TGGAGAACAA ACATGGGTCC 
10621 TTATTCCAGG GGAGCAAGTC GCTGTTGGCT GACACATCTT GCTCACATAC ATTTTACCTA 
10681 GACAATGGAG CCTCCATCAG AGTATTACTT TAGCTCCTCA CCGATGGCAA TGGACCACCT 
10741 CTCTACCCAC ATAGGAGTTG GGTCTCCACA CACCCCCACA CCCCCTTCAC CAAAACGTTT 
10801 TCAGTTACTT TATCTGGTAA AGTTCATCAG AGAATGAAGC CAGTATTAAG AACATGGAAT 
10861 CATTTGGGAA CCTGGATCTA GCAATACCCC ACCCTAGATQ GAGTTGCTGA GTTTTCACCT 
10921 CAGATTATAA TTCCCCCCTA GCTTCTATGG TTTATTCTGA AACCAGGGGA ACTCGATTCC 
10981 TCCCTTTGGA CCACAGACAT CCTGGCTTGT GAATTCACAT GTCATCTACT GCTAATCCAT 
11041 TGGTAGTATG TGGCTCACAG AGACACACTA CAGTCATGGC CAATGTCAAG GTAGGACAGA 
11101 TGTGAATCAT TCCCCCAGTC CTGCTGTTTT CATGACTAAC CCTCCTCAGC ACAGTGACCA 
11161 TGAACCTACT TTTCCCCTCC TTTTATTTTT AGAATTGCTG GAATTTTCTA TTTTGAGAAA 
11221 TAATAGCCTT GGGCAGCATT AAACAAAATC ATCTAGAAAG CTGGTTTAAA ATACAGATGG 
11261 TTGAGTCAGT GAAAGAGTGA GGAATGTCAT TATTGGCCCC TCACAGAGGC TGGCTCACTC 
11341 CAGCAGAGGT GGTTGAAGCT CTTGGACACG GGTCAGGTGC ATAGGAAAGG TNGTCTGGGA 
11401 CACTGAGAAC CACAATTGAA CAAACAGAAC TGTTGGCTTT TTTTTTTTTA AATGAGTTCT 
114 61 CAAAAAATGA CTGGCTAGCT TAGGCAAATA CTTCGAGCCA ACCCAACAGA ACATTCTTCC 
11521 ATTGATTCAT TCTGGATCTT CTTTCTAGAC AATACTGAAC TGACCCCTTG TTGGCAGTCT 
11581 CAAGTTTGAC AACATAGGGC TTTGAACTTG GCACAAGGTC CATCACTGTC ACCCAAGCAT 
11641 CCTGGGTGAC CTTTGGGTTG GAATATCTTG GCTAACCTTA GATATTTTCT TTGGAGTATC 
11701 TTTAGAACAT CCAGGAAATA GGGCTTGATT CTCATCCTGG GACCACAATA TAAGTCACCC 
11761 TAGAATCCCA GGAGATCGTG CAGAGAAACA AGGATCTCTC TCGTGTGCAT CCTTCTTCAA 
11821 AGCAGTGAGT AGTGACTCCA CTAAACTGAG TTCCCATCTG AGAGTCCACA GGAGGCTTTG 
11881 GGGCAAGAAG CAGAGGGAAG GCACTGTTTG TGTTGGTAAA GTTTTGACTC TAACAAATTT 
11941 GAAGACATAG ATGACATTGT GTCAGACTAA CAACAACCTA GACTCATGTG GGTTCTGTTT 
12001 AGGGATCAGA TTTTATTCAT CAATGACTTG TCTTAGTGTA TAGAGAAAGG CTTCCTACTG 
12061 GAGTGTAGGC TCAATAATGA CAGAAGAGAT AGCTATTTCC CCTAGGGACT GTGCTGCTCC 
12121 AAGTTTGGTG GAGAAAGGCA GTGGGGAACC TAGATGTGCT CTCTGGGGAG GGGGTCTGAA 
12181 GCTGGCTTCA TAGAAGGTGT GAAGTTTTGC TGAAACATCT AAACAGAATT ATAGCTTAGG 
12241 AAAGTGAGCA GGCAAGGCAG GGAATGTGTT GCATATGTAT ATGTACATGA ATATATTATG 
12301 TTATAGATAC ACACACATTT GAACCTCATT TGCAGATGAC AGAAAATAGG TTATTTTGCC 
123 61 TCTCTTAACT GCTAAGCACA ATGACTTCCA GTTCCATCCA TTTCCTGAAA TGCCACAATT 
12421 TCATTTTTCA TTGTGGCTGA ATAAAATTCC ATTGCAGACT GGGCCCTACT TCATCCACTC 
12481 CTGAGGGCAG GCATATCCCC TGGCTCCATT TCTTACCTAT TGTGAAGAGA AGTGCAACTG 
12541 TCTTGTTGAA AGGCAAGCGT GAGAQAGGCA GGCACTAATT GTGGGTTTTT GTTTCTTCTT 
12601 CCTGCTATGA CTCTCCATTT GTCAGAACCA AAGATCGATA AAAGCCGCCA CCATGAAAGC 
12661 CATCTTAATC CCATTTTTAT CTCTTCTGAT TCCGTTAACC CCGCAATCTG CATTCGCTCA 
12721 GAGTGAGCCG GAGCTGAAGC TGGAAAGTGT GGTGATTGTC AGTCGTCATG GTGTGCGTGC 
12781 TCCAACCAAG GCCACGCAAC TGATGCAGGA TGTCACCCCA GACGCATGGC CAACCTGGCC 
12841 GGTAAAACTG GGTTGGCTGA CACCGCGCGG TGGTGAGCTA ATCGCCTATC TCGGACATTA 
12901 CCAACGCCAG CGTCTGGTAG CCGACGGATT GCTGGCGAAA AAGGGCTGCC CGCAGTCTGG 
12961 TCAGGTCGCG ATTATTGCTG ATGTCGACGA GCGTACCCGT AAAACAGGCG AAGCCTTCGC 
13 021 CGCCGGGCrCG GCACCTGACT GTGCAATAAC CGTACATACC CAGGCAGATA CGTCCAGTCC 
13081 CGATCCGTTA TTTAATCCTC TAAAAACTGG CGTTTGCCAA CTGGATAACG CGAACGTGAC 
13141 TGACGCGATC CTCAGCAGGG CAGGAGGGTC AATTGCTGAC TTTACCGGGC ATCGGCAAAC 
13201 GGCGTTTCGC GAACTGGAAC GGGTGCTTAA TTTTCCGCAA TCAAACTTGT GCCTTAAACG 
13261 TGAGAAACAG GACGAAAGCT GTTCATTAAC GCAGGCATTA CCATCGGAAC TCAAGGTGAG 
13321 CGCCGACAAT GTCTCATTAA CCGGTGCGGT AAGCCTCGCA TCAATGCTGA CGGAGATATT 
13381 TCTCCTGCAA CAAGCACAGG GAATGCCGGA GCCGGGGTGG GGAAGGATCA CCGATTCACA 
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Figure 5 (continued^): 

13441 CCAGTGGAAC ACCTTGCTAA GTTTGCATAA CGCGCAATTT TATTTGCTAC AACGCACGCC 
13501 AGAGGTTGCC CGCAGCCGCG CCACCCCGTT ATTAGATTTG ATCAAGACAG CGTTGACGCC 

13 561 CCATCCACCG CAAAAACAGG CGTATGGTGT GACATTACCC ACTTCAGTGC TGTTTATCGC 
13621 CGGACACGAT ACTAATCTGG CAAATCTCGG CGGCGCACTG GAGCTCAACT GGACGCTTCC 
13681 CGGTCAGCCG GATAACACGC CGCCAGGTGG TGAACTGGTG TTTGAACGCT GGCGTCGGCT 
13741 AAGCGATAAC AGCCAGTGGA TTCAGGTTTC GCTGGTCTTC CAGACTTTAC AGCAGATGCG 
13801 TGATAAAACG CCGCTGTCAT TAAATACGCC GCCCGGAGAG GTGAAACTGA CCCTGGCAGG 
138 61 ATGTGAAGAG CGAAATGCGC AGGGCATGTG TTCGTTGGCA GGTTTTACGC AAATCGTGAA' 
13921 TGAAGCACGC ATACCCGCTT GCAGTTTGTA AGGTACCCGG GGATCACAAC TTGCCCTCTG 
13981 AAGAGGAAGA ACAGAAGGAT GCCACAACTC TCCTGCTGGC tactctccag tggtttcatc 
14041 TTACTTCTGA TGGCATTTCC CTCTAGAAAG TGCTACTATC ATCCACACAT TTCTACCTGA 
14101 GACCACCCAA AGGACCCTCC CAAATTCTCT TCCTCTCTGA GTAGTCTCCA CACCTGTTAC 
14161 CACCATCCCA GAATTAAAAT CCTAACTGCA CTCTGGCGTG TGACTTGCCT CAGTCCTTGC 
14221 AATAAGAGTT GTTGGCAGTG CCAGGCGTGG TGGCGCACGC CTTTAATTCC AGCACTTGGG 
142 Bl AGGCAGAGGC AGGCGGATTT CTGAGTTCGA GGCCAGCCTG GTCTACAGAG TGAGTTCCAG 
14341 GACAGCCAGG GCTATACAGA GAAACCCTGT GTCGAAAAAC CAAAAAAAAA AAAAAAAGTT 
144 01 GTTGGCAGAG TGTGGGTTAT ATACCAGGTG GAGATTTCAA ATGAGTGGCT GAAGCTGTAG 
14461 CCAGAAGGAA CTTAGAGGAT AGCTCATAAC TTAAAAAGAA ATGTAGAGAG TAGCAGAAAC 
14521 ATTGAGAGAG TGGGCACACA GCCACT6TGT GAATGTGGCA GAACACAATC CAGCCAGCTA 
14581 TACATGCATA AGTGTATATT GGCGCCATCC TGACTGATGA GACACAGGAA AACAGATAGA 
14641 CGGGGTTAGG TGGCCATGGC CTTTCCTGCC TGCCTCTTCC TAAGGGTCAT CfCAAGACCT 
14701 TATGCTCTCT TAACTCTTCC ATTGCTACTT AGCTTCTAGA TATCACCTCC AGATTAGTCT 
14761 CCTTGGGTAC ATCAGTGATC CTGGTGATAT CCAGGGCTTC CTGATTCCAT CTTTGTCATA 

14 821 GAGGCTGCAA CTAAAGAGGT CTTCTTAATA CTTCACACCC TGATGCC/^ AGGAAGACAC 
14881 AGAAGTTCAC AGAGGTGAAG TGATTCATGT AGGACATACA GTGAGCAA6C ATCAGGGTCC 
14941 GGATTATCTG ACTCTACTCT AACTTTTATG TAAATGTGCT TTATGCCATT AACACTGTCA 
15001 TTCCTGTGCT TCAGCTCTGG GAGACTCCCA AGCACTCTTA GGCACAAGCC ACAATTAAGG 
15061 GACTCTGACA CTCTGCATTG ATTAATTAGC ATGGTGGTCT CTATGTTTCC AGATTCATGA 
15121 TTGTTTCACT TTCCATATAG GCTATGAAGG GTGTGAGGAA ATTTTTTGGG GACAGAATTG 
15181 GAGGCAATCC ACCTCTCTCA GGAAGCCTCT ATCTGGAAAA GCTTACAACT CAGGGACAGT 
15241 AACTGTAGGC CCAGTCCTTG GTGTCCAAAA TGGGTTTTAT GGTTTGAATC TGCAAAGCCT 
15301 TCCATGTGCT CAAAGGTTTG AACATGGAGC CTCCTCCTGG TAACACTGTA TTGGAGGCTT 
15361 TTGAGACTGG ATGCTCTTTG GTCCCATGTT TTGCTACATC ATCTGTCAAG ATATGACCCA 
15421 GGCATGCTAC CAGCTACCAC AGACTATGCC TCTCCAGCTT TCATGTTCTC CCCACCATGA 
15481 TAGACTTGTA TCTCCTAAAA ATGGAATCAA AGCAAACTTT TCCTGCATTA AGTTTTTTTT 
15541 TTTCTGTTAA GTGTTTGGTC ACAGGGACAA GAAAACACTC AATACAGATA ATTAGTACCA 
15601 GAGTTGAGGT TCATTGCTCT AGCAAGTTGG ATCAAATTTT TAGGGCTTTG GAACTGATTT 
15661 ATAAGAGACA TGTAGAAGAG TCTGAAGCTG TGGGCTACAG AAGTGTCACC AGTTTTTAAG 
15721 AATAGTTTAA TACACCATGG GAATTGTGAA AATCAGAATG CTCACACAAA GGCAGACAGG 
15781 AAAACGTGAG CATGTGGCGT GTGAGAGGGC ATAAGAAGGA ACCTAGGGGG AAATGAGCTA 
15841 GAAGCCATTC GGCTACGTTA GGGAACGTGT GTGGCTGTGC TTGGCCCATG CCCTGGCAAT 
15901 CTGAATGAGG CCAAATTTTA AAGGAGTGGA CTAACTCGAT TGTCAGAGAA AATATCAAGA 
15961 CAGACCACCA CTCAGGCTAT GCCGTGTTTG TGACCGACCA GCTACTCTTA GCCAGCTCTA 
16021 TTGTGAAATT CCAGAGCAAT TATCAGAGCA TGAAGATACA TACAGTTTAG TGAAGTAAGG 
16081 GGTGTGGGTC CCTAAGTGGA TGGTGCATAA ATCTATGTAG GTGATGCCTA AGTGACACTT 
16141 GATAATCCAA AATATCAGCA ATGTGGAATG TCTTCCAAGG AGACCTGTAG ACACACATTT 
16201 TAGAACTTTG CTCATGGCTG TAATAAATAG CTAGCTAGAA ATCATTTCCT GAAGAGGTTA 
16261 GTCTGAGTTA CGGTTCCAGG GCAAACATTC AGTGATGGCA AGGAAGGCAT TGCAGTCAGG 
16321 AGCCAAAGGT CAGCTGGTCA CATTGCATCA AGAGTAGAGA GTCAGAGTGT GAGTAGAAAG 
16381 AGGATACAGG TTATAAAACC TCACTGTCCA CTCTCAGCAA TCCATTTTCT CCTAAAAGGC 
16441 TTTACCTTCT AAAGATTTTA GTCTTCAAAA CCAGTACCAG TAGCCTGGGA ACAAAAGTTG 
16501 AAACAAATGA GCCTTTGTGG GGCATTTCAC ACTTAAAACA GGGCATCACC TAGGAGGAGC 
16561 CCTGTGTGCA GTAGGAAGTG TGGCCTCTGT GTCAGGAATG CTCAGGCTAA TAAGGGGTCC 
16 621 TCTATCTGAG GGACCCTATG AAGATTCAAC AAGTAGTTGT GAGAATTCCC TGTAAATGGA 
16681 TGCTACCAAT TTGACATTTG TAGACCTGCT ATTGTGTGCT TCTTTATTGG GCTCTCCCAT 
16741 CTCCCAACTT TCCAACCCAT ATTCCACATT AATCCCTTCC ACCACCATGC AACACTAGGT 
16801 AGGAGAGAAG GAAGGTTAGA AGAGAAAGTG GGTATAGATC TATTTAGACT ACTTCCTGCT 
16 861 GATTAGGGGC AAGTCCAATC GTCATTGTCA GGATACCTCC AACCAGCAAC CAGCAAACCA 
16 921 GCAAATCAGA AACAGCAAAA GCAGCCAACA AGGCAGCACT AACCAGCAGG ATTGGGGTCG 
16981 GTAGCGTGGG AGCAGTCACT ACTGGTCTTC TCATGGCTTT GGCATTAATA CTCTCTCAAG 
17041 AAATTCCGTA ATTTTTTCCC CACCACCTGA AATTCCGTAA TTTTAAATGC AAACTATCTA 
17101 CAGCTGGCAA AAATCACATC TCTCCTAGAG CACAAGACAA ATCATAGTTA CTGGCTATTT 
17161 GCAATCTGAA GCATCTCAAT ATCCCACACC TGGGATTAAA ACAAAAACAT ATTCACATCA 
17221 CATA\CTGTT TTTTTTTTCC AATTTTTTAT TAGGTATTTT CTTTATTTAC ATTTCAAATG 
17281 CTATCCCGAA AGTCCCCTAT ACCCTCCCAC CTCCCTGCTC CCCTACACAC CCACTCCCAC 
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Figgre 5 (continued): 

17341 TTTTTGACCC TGGAGTTCCC CGGTACTGGG 
17401 CTCTTCCCAG TGATGGCCGA CTAAGCCATC 
17461 CTCTGGGGGT ACTAGTTAGT TCATATTGTT 
17521 GCTCCTTGGG TACTTTGTCT AGCTCCTCCA 
17581 ACTGTGAGCA TCCACTTCTG TATTTGACAG 
17641 TATCAGGGTC CTTTCAGCAA AACCTTGCTG 
17701 TGATTATGGG ATGGATCCAC TAGTTCTAGA 
17761 TTGTTCCCTT TAGTGAGGGT TAATTGCGCG 
17821 TGTGTGAAAT TGTTATCCGC TCACAATTCC 
17881 TAAAGCCTGG GGTGCCTAAT GAGTGAGCTA 
17941 CGCTTTCCAG TCGGGAAACC TGTCGTGCCA 
18001 GAGAGGCGGT TTGCGTATTG GGCGCTCTTC 
18061 GGTCGTTCGG CTGCGGCGAG CGGTATCAGC 
18121 AGAATCAGGG GATAACGCAG GAAAGAACAT 
1B181 CCGTAAAAAG GCCGCGTTGC TGGCGTTTTT 
18241 CAAAAATCGA CGCTCAAGTC AGAGGTGGCG 
18301 GTTTCCCCCT GGAAGCTCCC TCGTGCGCTC 
18361 CCTGTCCGCC TTTCTCCCTT CGGGAAGCGT 
18421 TCTCAGTTCG GTGTAGGTCG TTCGCTCCAA 
18481 GCCCGACCGC TGCGCCTTAT CC3GGTAACTA 
18541 CTTATCGCCA CTGGCAGCAG CCACTGGTAA 
18601 TGCTACAGAG TTCTTGAAGT GGTGGCCTAA 
18661 TATCTGCGCT CTGCTGAAGC CAGTTACCTT 
18721 CAAACAAACC ACCGCTGGTA GCGGTGGTTT 
18781 AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT 
18841 CGAAAACTCA CGTTAAGGGA TTTTGGTCAT 
18901 CCTTTTAAAT TAAAAATGAA. GTTTTAAATC 
18961 TGACAGTTAC CAATGCTTAA TCAGTGAGGC 
19021 ATCCATAGTT GCCTGACTCC CCGTCGTGTA 
19081 TGGCCCCAGT GCTGCAATGA TACCGCGAGA 
19141 AA.TAAACCAG CCAGCCGGAA GGGCCGAGCG 
19201 CATCCAGTCT ATTAATTGTT GCCGGGAAGC 
19261 GCX3CAACGTT GTTGCCATTG CTACAGGCAT 
19321 TTCATTCAGC TCCGGTTCCC AACGATCAAG 
19381 A7UVAGCGGTT AGCTCCTTCG GTCCTCCGAT 
19441 ATCACTCATG GTTATGGCAG CACTGCATAA 
19501 CTTTTCTGTG ACTGGTGAGT ACTCAACCAA 
19561 GAGTTGCTCT TGCCCGGCGT CAATACGGGA 
19621 AGTGCTCATC ATTGGAAAAC GTTCTTCGGG 
19681 GAGATCCAGT TCGATGTAAC CCACTCGTGC 
19741 CACCAGCGTT TCTGGGTGAG CAAA7VACAGG 
19S01 GGCGACACGG AAATGTTGAA TACTCATACT 
19861 TCAGGGTTAT TGTCTCATGA GCGGATACAT 
19921 AGGGGTTCCG CGCACATTTC CCCGAAAAGT 
19981 TTAAAATTCG CGTTAAATTT TTGTTAAATC 
20041 GGCAAAATCC CTTATAAATC AAAAGAATAG 
20101 TGGAACAAGA GTCCACTATT AAAGAACGTG 
20161 TATCAGGGCG ATGGCCCACT ACGTGAACCA 
20221 TGCCGTAAAG CACTAAATCG GAACCCTAAA 
20281 AAGCCGGCGA ACGTGGCGAG AAAGGAAGGG 
20341 CTGGCAAGTG TAGCGGTCAC GCTGCGCGTA 
20401 CTACAGGGCG CGTCCCATTC GCCATTCAGG 
20461 CGGGCCTCTT CGCTATTACG CCAGCTGGCG 
20521 TGGGTAACGC CAGGGTTTTC CCAGTCACGA 
20581 TAATACGACT CACTATAGGG CGAATTGGGT 



GCATATAAAG TTTGCAAGAC CAAGGGGCCT 
TTCTGCTACA TATGCAGATA GAGACACGAG 
GTTCCACCTA TAGGGTCGCA GACCCCTTCA 
CTGGGGGCTC TGTGTTTTAT CTAATAGATG 
GCACTGGCCT AGCGTCACAT GAGCCAGCTA 
GCATGTGCAA TAGTGTCTGC GTTTGGTGGT 
GCGGCCGCCA CCGCGGTGGA GCTCCAGCTT 
CTTGGCGTAA TCATGGTCAT AGCTGTTTCC 
ACACAACATA CGAGCCGGAA GCATAAAGTG 
ACTCACATTA ATTGCGTTGC GCTCACTGCC 
GCTGCATTAA TGAATCGGCC AACGCGCGGG 
CGCTTCCTCG CTCACTGACT CGCTGCGCTC 
TCACTCAAAG GCGGTAATAC GGTTATCCAC 
GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA 
CCATAGGCTC CGCCCCCCTG ACGAGCATCA 
AAACCCGACA GGACTATAAA GATACCAGGC 
TCCTGTTCCG ACCCTGCCGC TTACCGGATA 
GGCGCTTTCT CATAGCTCAC GCTGTAGGTA 
GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 
TCGTCTTQAG TCCAACCCGG TAAGACACGA 
CAGGATTAGC AG^^CGAGGT ATGTAGGCGG 
CTACGGCTAC ACTAGAAGGA CAGTATTTGG 
CGGAAAAAGA GTTGGTA6CT CTTGATCCGG 
TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 
CTTTTCTACG GGGTCTGACG CTCAGTGGAA 
GAGATTATCA AAAAGGATCT TCACCTAGAT 
AATCTAAAGT ATATATGAGT AAACTTGGTC 
ACCTATCTCA GCGATCTGTC TATTTCGTTC 
GATAACTACG ATACGGGAGG GCTTACCATC 
CCCACGCTCA CCGGCTCCAG ATTTATCAGC 
CAGAAGTGGT CCTGCAACTT TATCCGCCTC 
TAGAGTAAGT AGTTCGCCAG TTAATAGTTT 
CGTGGTGTCA CGCTCGTCGT TTGGTATGGC 
GCGAGTTACA TGATCCCCCA TGTTGTGCAA 
CGTTGTCAGA AGTAAGTTGG CCGCAGTGTT 
TTCTCTTACT GTCATGCCAT CCGTAAGATG 
GTCATTCTGA GAATAGTGTA TGCGGCGACC 
TAATACCGCG CCACATAGCA GAACTTTAAA 
GCGAAAACTC TCAAGGATCT TACCGCTGTT 
ACCCAACTGA TCTTCAGCAT CTTTTACTTT 
AAGGCAAAAT GCCGCAAAAA AGGGAATAAG 
CTTCCTTTTT CAATATTATT GAAGCATTTA 
ATTTGAA.TGT ATTTAGAAAA ATAAACAAAT 
GCCACCTAAA TTGTAAGCGT TAATATTTTG 
AGCTCATTXT TTAACCAATA GGCCGAAATC 
ACCGAGATAG GGTTGAGTGT TGrTCCAGTT 
GACTCCAACG TCAAAGGGCG AAAAACCGTC 
TCACCCTAAT CAAGTTTTTT GGGGTCGAGG 
GGGAGCCCCC GATTTAGAGC TTGACGGGGA 
AAGAAAGCGA AAGGAGCGGG CGCTAGGGCG 
ACCACCACAC CCGCCGCGCT TAATGCGCCG 
CTGCGCAACT GTTGGGAAGG GCGATCGGTG 
AAAGGGGGAT GTGCTGCAAG GCGATTAAGT 
CGTTGTAAAA CGACGGCCAG TGAGCGCGCG 
ACCGGGCCCC CCC 
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Figure 18: Nucleic acid sequence of the known segment of the RlS/appaH-intron plasmid, 
including the vector sequences of pBLCATS (SEP ID NO:2). 



LOCUS R15/appa+intron 6708 bp DNA SYN 15-APR-2000 

DEFINITION R15/appa+intron transgene with vector cut 13543 to 4954 
ACCESSION RlS/appa+intron 
REFERENCE 1 (bases 1 to 6708) ) 
SOURCE synthetic construct. 

ORGANISM synthetic construct 
artificial sequence, 
salivary proline -rich protein, acid glucose -1 -phosphatase; appA 
gene; periplasmic phosphoanhydride phosphohydrolase; artificial 
sequence; 

Golovan, Forsberg, C.W. , Phillips, J. 

Unpublished, 



KEYWORDS 



AUTHORS 
JOURNAL 



DEFINITION 
ACCESSION 
VERSION 
SOURCE 

ORGANISM 

Mammalia; 

Rattus , 

REFERENCE 
AUTHORS 
TITLE 
encoding 

JOURNAL 
MEDLINE 
FEATURES 

source 



Rat salivary proline -rich protein (RP15) gene. 

M64793 M36414 

M64793. 1 GI :206711 

Rat (Sprague-Dawley) liver DNA. 

Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 

1 {bases 1 to 1748) 
Lin,H.H, and Ann^D.K, 

Molecular characterization of rat multigene family 

proline -rich proteins 
Genomics 10, 102-113 (1991) 
91257817 

Location/Qualifiers 
1- .1748 

/organism* "Rattus norvegicus" 
/ s t ra ina= " Spr ague - Dawl ey " 
/db_xref=s"taxon: 10116" 
/ 1 i s sue_^type« " 1 i ver " 

/tissue_libas"cosmid genomic library" 
1802-1810 

/function*" consensus sequence for initiation in 

higher eukaryotes 



misc feature 



FEATURES Location/Qualifiers 

DEFINITION E- coli periplasmic phosphoanhydride phosphohydrolase (appA) 
gene, 

ACCESSION M58708 L03370 L03371 L03372 L03373 L03374 L03375 

VERSION M58708.1 GJ: 145283 

SOURCE Escherichia coli DNA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria; gamma subdivision; 
Enterobacteriaceae ; 

Escherichia . 

REFERENCE 1 (bases 1811.. 3109) 

AUTHORS Dassa, J. , Marck,C. and Boquet,P.L. 
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TITLE 



JOURNAL 
MEDLINE 



The complete nucleotide sequence of the Escherichia coli 
gene appA reveals significant homology between pH 2.5 
acid phosphatase and glucose -1-phosphatase 

J. Bacterid. 172 (9), 5497-5500 (1990) 

90368616 



FEATURES 

Source 



sig_j)eptide 
/gene="appA 
CDS 



11 



Loca t ion/ Qual i f ier s 
1811, .3109 

/organism=" Escherichia coli" 
/ db_xref = " taxon : 5 62 " 
1811. . 1876 

1811 . . 3109 
/gene= "appA" 

/standard_name="acid phosphatase/phytase" 
/ trans l_^table= 11 

/product^ "periplasmic phosphoanhydride 
phosphohydrolase " 
/protein_id= " AAA72 0 86.1" 
/db xref="GI: 145285" 



/ translations "MKAILIPFLSIjLIPLTPQSAFAQSEPELKLESWIVSRHGVRAP 
TKATQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYLGHYQRQRLVADGLLAKKGCPQS 
GQVAIIADVDERTRKTGEAPAAGtAPDCAITVHTQADTSSPDPLFNPLKTGVCQLDNA 
IT^miAILSRAGGSIADFTGHRQTMRELERVLNFPQSNLCLKREKQDESCSLTQALPS 

ELKVSADWSLTGAVSIoASMLTEIFLLQQAQGMPEPGWGRITDSHQWNTLLSLHNAQF 

YLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFIAGHDTNLANLGG 

ALELNWTLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSLVFQTLQQMRDKTPLSLNT 

PPGEVKLTLAGCEERNAQGMCSLAGFTQIVNEARIPACSL" 
matjpeptide 1877 3106 

/ g ene = " app A " 

/product= "periplasmic phosphoanhydride 
phosphohydrolase " 



mutation 



mutation 



mutation 



replace (1817 • . 1819, "gcg changed to gcc") 
/ gene 5= " app A" 

/standard_name-"A3 mutant" 

/note=" created by site directed mutagenesis" 
/phenotypess" silent mutation" 
replace (3092 3094 , " ccg changed to ccc") 
/ gene = " appA " 

/standard_name=: " P428 mutant" 

/notes=" created by site directed mutagenesis" 

/phenotype=" silent mutation " 

replace (3095 . .3097, " gcg changed to get") 

/gene="appA" 

/standard_name= " A4 2 9 mutant" 

/note="created by site directed mutagenesis" 
/phenotype=" silent mutation " 
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DEFINITION Plasmid pBLCAT3 (bases 3109 to 6708) 
ACCESSION X644 05 

X64409,l GI:S8163 
synthetic construct . 
synthetic construct 
artificial sequence. 

1 (bases 3109 to 6708) 
Luckow, B . H . R . 
Direct Submission 

Submitted (06 -FEB- 1992) B.H.R. Luckow, German Cancer Res 
Center^ Im Neuenheimer Feld 280, W-6900 Heidelberg, FRG 

2 (bases 3109 to 6708) 

Luckow, B. and Schutz,G. — 
CAT constructions with multiple unique restriction sites 



VERSION 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 



for 

regulatory 

JOURNAL 
MEDLINE 
COMMENT 
experiments 

FEATURES 

source 



the functional analysis of eukaryotic promoters and 
elements 

Nucleic Acids Res. 15 (13), 5490 (1987) 
87260024 

Promoterless CAT vector for transient trans fection 

with eukaryotic cells. Allows the analysis of foreign 
promoters and enhancers . 

Location/Oualif iers 

3109 to 6116 

/organisms "synthetic construct'* 
/db xref="taxon: 3263 0" 



SV40 t intron 3197,. 3810 

/note="SV4 0 signals" 

polyA_signal 3807.. 4047 

/note='*SV4 0 signals" 

CDS complement (5244. .6104) 

/codon_start=l 
/ trans l_table=ll 
/genets "Amp" 

/product* "beta- lactamase" 
/protein_id=:"CAA45753 . 1" 
/db xref*"GI: 58165" 



BASE COUNT 1916 a 147 9 c 1515 g 

ORIGIN 

1 GGATCCCCTT TGCTATGTAG TTTTTAATGG 
61 GAGAGTCCTG TTTGGTTTAA GCAACCTCTG 
121 CTCTTTGTTT CTAGCATAAC CAAAAGATTT 
181 ATAGGTCTAA TAACCCCGAA AATATTACCA 
241 CATGTAGTAT CCATAGTCCA TCAATGAGAG 
301 TGGAAAAGAC ATGACAACAT TCACAGGCAC 
361 TATTTCACTA AACTAGGTTT ATCTATTTTG 
421 AGGTCAACAG TGCCACATAT CCTTTACTTA 
4 81 TATCCTGGTT AGAGAGTGCT TAAAATAAGT 
541 TTAACAATTA AGACAGTATT TATTTAAAGC 
601 TGGGAAGAAA CCATTTGGTG AACAATATTT 
661 TAAAACATAT GTTTGACCA-G CCCTTCTTTT 
721 GATTCTCTTT GGGTGGCTGC AAATTGTCCA 
7B1 GAGTCTCACA AJLATGA-AAAG GAAATATATT 
841 CACAAATTAA"^.GAAA.ACCTG TGGTGAATGA 
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798 t 

AAATTACAAC CCATAGTGTG TTGATAAATA 
TTTCTCATAA ACTCCATAAA AACAGGAATA 
AGTGAATTGA AAACAATGTT CCCTTAGAGT 
TGATACTGAG CATTTGTAAG TATCTCATAG 
AGACATTTAA CATGATTTTC ATTAATCAGG 
TGCACAGAAC ATAGTGGTCC A'CCTTGCACA 
TTGCTTTCTC TAACATCTCT GCAATGAAGC 
ACCTAAGGAA CACAAAAAAT TTTCTACATA 
TTTCCAAGAA TGGAAAAGAA ATGTTCTGAC 
AAGAAATATG AGGCA.CACAA GAAAATATTT 
CAAATAAAAA TAGACAAACA TAGTTAATTG 
CAATAGGCTT AATGTGAATA AAATGTTAAA 
CGAATAAGAC AAAATATAAA AATAAGGACT 
CAGAAAGAGA ATCTTGAGAG AATGTGTTGT 
CATCCTGAGG CCTGAGCTAT TACTGACATT 
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Figure 18 (continued^: 

901 TAAGATAAAG GTAACTGTAT ACATTTGTCC CATTGAGGGG ACAAGATU^GC TGCTCTCATG 
961 TTCAGCTCTA TAATTCTTGC CTTAAACAAC TTAAATAGAA TGATTTAAAA TATGGAGCTG 
1021 TCCATGGACC TTTGAAATAT AAAATAGTCA AGCAACTTAT CAAGGAATTA CAGATTCCTT 
1081 GATACTAACA CAGGTAAATC CCACACGTGT TTTGAGACTA CATTTGCTGG GATTTTATTG 
1141 ATGTAATAGG TCACATGTTT TTCGGGCCAA TGTTGCTGTT ATTCGGTTAC TTCAAGAGAA 
1201 TAGTGGCAAC TGATGCTATG TATTCTAGGG GTTTGAAGTG ATGTTTCATG ATTGAAATTT 
1261 GTAAAAGAAT AACATCATCA TTCTTAACAA TAGAACATAT AAAGTCACAC AGAAGTGACA 
1321 GTGTTTAAGC TGTACTATTG ATCAAAGAAA TTTATTACCT TCAGTTTCAA TGGAAATAAT 
1381 TACTGATAAT ACAAACATGT GTGAACACAC ACTAATCCTA TCCAAATGCA CAGTGATACA 
1441 CAGAAAATAT TAGCAAGTAG AATGCAATAT TTATATAACG ATTGTATTTA TCAATCAATT 
1501 GTATGTATCA ATATATGGGC TATTTTCTTA CACATGATTT TATTCAAATT TACTCTAATC 
1561 ATTGTTGAAC CATTTAGAAA AGGCATACTG GCAACTTTTC CTTACCTCAT CCAGCTGGGC 
1621 AAAAGTCCCA GTGTGGAGTA AAGGATGCAA GATTTCCTGC TCTGTTAAGT ATAAAATAAT 
1681 AGTATGAATT CAAAGGTGCC ATTCTTCTGC TTCTAGTTAT AAAGGCAGTG CTTGCTTCTT 
1741 CCAGCACAGA TCTGGATCTC GAGGAGCTTG GCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
1801 AGCCGCCACC ATGAAAGCCA TCTTAATCCC ATTTTTATCT CTTCTGATTC CGTTAACCCC 
1861 GCAATCTGCA TTCGCTCAGA GTGAGCCGGA GCTGAAGCTG GAAAGTGTGG TGATTGTCAG 
1921 TCGTCATGGT GTGCGTGCTC CAACCAAGGC CACGCAACTG ATGCAGGATG TCACCCCAGA 
1981 CGCATGGCCA ACCTGGCCGG TAAAACTGGG TTGGCTGACA CCGCGCGGTG GTGAGCTAAT 
2041 CGCCTATCTC GGACATTACC AACGCCAGCG TCTGGTAGCC GACGGATTGC TGGCGAAAAA 
2101 GGGCTGCCCG CAGTCTGGTC AGGTCGCGAT TATTGCTGAT GTCGACGAGC GTACCCGTAA 
2161 AACAGGCGAA GCCTTCGCCG CCGGGCTGGC ACCTGACTGT GCAATAACCG TACATACCCA 
2221 GGCAGATACG TCCAGTCCCG ATCCGTTATT TAATCCTCTA AAAACTGGCG TTTGCCAACT 
2281 GGATAACGCG AACGTGACTG ACGCGATCCT CAGCAGGGCA GGAGGGTCAA TTGCTGACTT 
2341 TACCGGGCAT CGGCAAACGG CGTTTCGCGA ACTGGAACGG GTGCTTAATT TTCCGCAATC 
2401 AAACTTGTGC CTTAAACGTG AGAAACAGGA CGAAAGCTGT TCATTAACGC AGGCATTACC 
2461 ATCGGAACTC AAGGTGAGCG CCGACAATGT CTCATTAACC GGTGCGGTAA GCCTCGCATC , 
2521 AATGCTGACG GAGATATTTC TCCTGCAACA AGCACAGGGA ATGCCGGAGC CGGGGTGGGG 
25 81 AAGGATCACC GATTCACACC AGTGGAACAC CTTGCTAAGT TTGCATAACG CGCAATTTTA 
2641 TTTGCTACAA CGCACGCCAG AGGTTGCCCG CAGCCGCGCC ACCCCGTTAT TAGATTTGAT 
27 01 CAAGACAGCG TTGACGCCCC ATCCACCGCA AAAACAGGCG TATGGTGTGA CATTACCCAC 
2 7 61 TTCAGTGCTG TTTATCGCCG GACACGATAC TAATCTGGCA AATCTCGGCG GCGCACTGGA 
2 821 GCTCAACTGG ACGCTTCCCG GTGAGCCGGA TAACACGCCG CCAGGTGGTG AACTGGTGTT 
2 8 81 TGAACGCTGG CGTCGGCTAA GCGATAACAG CCAGTGGATT CAGGTTTCGC TGGTCTTCCA 

2 941 GACTTTACAG CAGATGCGTG ATAAAACGCC GCTGTCATTA AATACGCCGC CCGGAGAGGT 

3 001 GAAACTGACC CTGGCAGGAT GTGAAGAGCG AAATGCGCAG GGCATGTGTT CGTTGGCAGG 
3 061 TTTTACGCAA ATCGTGAATG AAGCACGCAT ACCCGCTTGC AGTTTGTAAG GTATAAGGCA 
3121 GTTATTGGTG CCCTTAAACG CCTGGTGCTA CGCCTGAATA AGTGATAATA AGCGGATGAA 
3181 TGGCAGAAAT TCGCCGGATC TTTGTGAAGG AACCTTACTT CTGTGGTGTG ACATAATTGG 
3241 ACAAACTACC TACAGAGATT TAAAGCTCTA AGGTAAATAT AAAATTTTTA AGTGTATAAT 
3301 GTGTTAAACT ACTGATTCTA ATTGTTTGTG TATTTTAGAT TCCAACCTAT GGAACTGATG 
3 361 AATGGGAGCA GTGGTGGAAT GCCTTTJ^TG AGGAAAACCT GTTTTGCTCA GAAGAAATGC 
3421 CATCTAGTGA TGATGAGGCT ACTGCTGACT CTCAACATTC TACTCCTCCA AAAAAGAAGA 
3481 GAAAGGTAGA AGACCCCAAG GACTTTCCTT CAGAATTGCT AAGTTTTTTG AGTCATGCTG 
3541 TGTTTAGTAA TAGAACTCTT GCTTGCTTTG CTATTTACAC CACAAAGGAA AAAGCTGCAC 
3601 TGCTATACAA GAAAATTATG GAAAAATATT CTGTAACCTT TATAAGTAGG CATAACAGTT 
3 661 ATAATCATAA CATACTGTTT TTTCTTACTC CACACAGGCA TAGAGTGTCT GCTATTAATA 
3721 ACTATGCTCA AAAATTGTGT ACCTTTAGCT TTTTAATTTG TAAAGGGGTT AATAAGGAAT 
3 7 81 ATTTGATGTA TAGTGCCTTG ACTAGAGATC ATAATCAGCC ATACCACATT TGTAGAGGTT 
3 841 TTACTTGCTT TAAAAAACCT CCCACACCTC CCCCTGAACC TGAAACATAA AATGAATGCA 
3 901 ATTGTTGTTG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC 

3 961 ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCAAACTC 

4 021 ATCAATGTAT CTTATCATGT CTGGATCGAT CCCCGGGTAC CGAGCTCGAA TTCGTAATCA 
4 081 TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA CAACATACGA 
4141 GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG TGAGCTAACT CACATTAATT 
42 01 GCGTTGCGCT CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT GCATTAATGA 
42 61 ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GCTCTTCCGC TTCCTCGCTC 
4321 ACTGACTCGC TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG 
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Figure 18 (continued): 

4381 GTAATACGGT TATCCACAGA ATCAGGGGAT 
4441 CAGCAAAAGG CCAGGAACCG TAAAAAGGCC 
4501 CCCCCTGACG AGCATCACAA AAATCGACGC 
4561 CTATAAAGAT ACCAGGCGTT TCCCCCTGGA 
4621 CTGCCGCTTA CCGGATACCT GTCCGCCTTT 
4 681 TGCTCACGCT GTAGGTATCT CAGTTCGGTG 
4741 CACGAACCCC CCGTTCAGCC CGACCGCTGC 
4801 AACCCGGTAA GACACGACTT ATCGCCACTG 
4861 GCGAGGTATG TAGGCGGTGC TACAGAGTTC 
4921 AGAAGGACAG TATTTGGTAT CTGCGCTCTG 

4 981 GGTAGCTCTT GATCCGGCAA ACAAACCACC 
5041 CAGCAGATTA CGCGCAGAAA AAAAGGATCT 
5101 TCTGACGCTC AGTGGAACGA AAACTCACGT 
5161 AGGATCTTCA CCTAGATCCT TTTAAATTAA 
5221 TATGAGTAAA CTTGGTCTGA CAGTTACCAA 
5261 ATCTGTCTAT TTCGTTCATC CATAGTTGCC 
5341 CGGGAGGGCT TACCATCTGG CCCCAGTGCT 
54 01 GCTCCAGATT TATCAGCAAT AAACCAGCCA 
5461 GCAACTTTAT CCGCCTCCAT CCAGTCTATT 
5521 TCGCCAGTTA ATAGTTTGCG CAACGTTGTT 
5581 TCGTCGTTTG GTATGGCTTC ATTCAGCTCC 

5 641 TCCCCCATGT TGTGCAAAAA AGCGGTTAGC 
5701 AAGTTGGCCG CAGTGTTATC ACTCATGGTT 
5761 ATGCCATCCG TAAGATGCTT TTCTGTGACT 
5821 TAGTGTATGC GGCGACCGAG TTGCTCTTGC 
5881 CATAGCAGAA CTTTAAAAGT GCTCATCATT 
5941 AGGATCTTAC CGCTGTTGAG ATCCAGTTCX3 
6001 TCAGCATCTT TTACTTTCAC CAGCGTTTCT 
6061 GCAAAAAAGG GAATAAGGGC GACACGGAAA 
6121 TATTATTGAA GCATTTATCA GGGTTATTGT 
6181 TAGAAAAATA AACAAATAGG GGTTCCGCGC 
6241 TAAGAAACCA TTATTATCAT GACATTAACC 
63 01 CGTCTCGCGC GTTTCGGTGA TGACGGTGAA 
6361 GTCACAGCTT GTCTGTAAGC GGATGCCGGG 
6421 GGTGTTGGCG GGTGTCGGGG CTGGCTTAAC 
6481 GTGCACCATA TGCGGTGTGA AATACCGCAC 
6541 CGCCATTCGC CATTCAGGCT GCGCAACTGT 
6601 CTATTACGCC AGCTGGCGAA AGGGGGATGT 
6661 GGGTTTTCCC AGTCACGACG TTGTAAAACG 



AACGCAGGAA AGAACATGTG AGCAAAAGGC 
GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC 
TCAAGTCAGA GGTGGCGAAA CCCGACAGGA 
AGCTCCCTCG TGCGCTCTCC TGTTCCGACC 
CTCCCTTCGG GAAGCGTGGC GCTTTCTCAA 
TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG 
GCCTTATCCG GTAACTATCG . TCTTGAGTCC 
GCAGCAGCCA CTGGTAACAG GATTAGCAGA 
TTGAAGTGGT GGCCTAACTA CGGCTACACT 
CTGAAGCCAG TTACCTTCGG AAAAAGAGTT 
GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG 
CAAGAAGATC CTTTGATCTT TTCTACGGGG 
TAAGGGATTT TGGTCATGAG ATTAXCAAAA 
AAATGAAGTT TTAAATCAAT CTAAAGTATA 
TGCTTAATCA GTGAGGCACC TATCTCAGCG 
TGACTCCCCG TCGTGTAGAT AACTACGATA 
GCAATGATAC CGCGAGACCC ACGCTCACCG 
GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT 
AATTGTTGCC GGGAAGCTAG AGTAAGTAGT 
GCCATTGCTA CAGGCATCGT GGTGTCACGC 
GGTTCCCAAC GATCAAGGCG AGTTACATGA 
TCCTTCGGTC CTCCGATCGT TGTCAGAAGT 
ATGGCAGCAC TGCATAATTC TCTTACTGTC 
GGTGAGTACT CAACCAAGTC ATTCTGAGAA 
CCGGCGTCAA TACGGGATAA TACCGCGCCA 
GGAAAACGTT CTTCGGGGCG AAAACTCTCA 
ATGTAACCCA CTCGTGCACC CAACTGATCT 
GGGTGAGCAA AAACAGGAAG GCAAAATGCc' 
TGTTGAATAC TCATACTCTT CCTTTTTCAA 
CTCATGAGCG GATACATATT TGAATGTATT 
ACATTTCCCC GAAAAGTGCC ACCTQACGTC 
TATAAAAATA GGCGTATCAC GAGGCCCTTT 
AACCTCTGAC ACATGCAGCT CCCGGAGACG 
AGCAGACAAG CCCGTCAGGG CGCGTCAGCG 
TATGCGGCAT CAGAGCAGAT TGTACTGAGA 
AGATGCGTAA GGAGAAAATA CCGCATCAGG 
TGGGAAGGGC GATCGGTGCG GGCCTCTTCG 
GCTGCAAGGC GATTAAGTTG GGTAACGCCA 
ACGGCCAGTG CCAAGCTT 
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Figure 19: Nucleic acid sequence of the known Reprne nt of the RlS/appa+intron transgCTe 
used for the generation of transgenic mice fSEO ID NO: 3V 



LOCUS R15/appa 4060 bp DNA SYN 15-APR-2000 

DEFINITION R15/appa transgene without vector 

ACCESSION R15/appa 

REFERENCE 1 (bases 1 to 4060) 

SOURCE synthetic construct. 

ORGANTISM synthetic construct 

artificial sequence . 
salivary proline -rich protein, acid glucose -1 -phosphatase; appA 
gene; periplasmic phosphoanhydride phosphohydrolase; artificial 
sequence; 

Golovan, Forsberg, C.W» , Phillips, J. 

Unpubl i shed . 



KEYWORDS 



AUTHORS 
JOURNAL 



DEFINITION Rat salivary proline-rich protein (RP15) gene. 
ACCESSION M64793 M36414 

M64793 ,1 GI: 2 06711 
Rat (Sprague-Dawley) liver DNA. 
ORGANISM Rattus norvegicus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; 



VERSION 
SOURCE 



Mammalia; 

Rattus . 

REFERENCE 
AUTHORS 
TITLE 

encoding 

JOURNAL 
MEDLINE 
FEATURES 

source 



Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; 

1 (bases 1 to 1748) 
Lin,H.H. and Ann,D.K. 

Molecular characterization of rat multigene family 

proline-rich proteins 
Genomics 10, 102-113 (1991) 
91257817 

Location/ Qualifiers 
1..1748 

/organism^ "Rattus norvegicus" 
/strains " Sp r ague -Dawley" 
/ db_xref « " t cocon : 1 0 1 16 " 
/ tissue_typea= " liver" 

/tissue_lib="cosmid genomic library" 
1802-1810 

/functions" consensus sequence for initiation in 

higher eukaryotes ** 



mi so feature 



FEATURES Location/ Qualifiers 

DEFINITION E. coli periplasmic phosphoanhydride phosphohydrolase (appA) 
gene. 



ACCESSION 
VERSION 
SOURCE 
ORGANISM 



M58708 L03370 L03371 L03372 L03373 L03374 L0337S 
M58708 . 1 GI: 145283 

Escherichia coli DNA. 
Escherichia coli 



Bacteria; Proteobacteria; gamma subdivision; 
Enterobac teriaceae ; 
Escherichia. 

REFERENCE 1 (bases 1811. .3109) 

AUTHORS Dassa,J., March, C. and Boquet,P.L. 
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Figure 19 (continued): 



TITLE 



JOURNAL 
MEDLINE 



The complete nucleotide sequence of the Escherichia coli 
gene appA reveals significant homology between pH 2.5 
acid phosphatase and glucose -1-phosphatase 

J. Bacterid. 172 (9), 5497-5500 (1990) 

90368616 



FEATURES 

Source 



sig_peptide 
/gene="appA 



II 



CDS 



Location/ Qualifiers 

1811. . 3109 
/organisms "Escherichia coli" 
/db_xref = " taxon : 562 " 
1811. . 1876 

1811. .3109 

/ gene= " appA" 

/standard_names="acid phosphatase /phytase 
/ trans l_t abl e =11 

/ produc t s= " per ip 1 a smic pho sphoanhydri de 
phosphohydlrolase '* 
/protein_id="AAA72086 .1" 
/db xref="GI: 145285" 



/translation^ "MKAILIPFLSLLIPLTPQSAFAQSEPELKLESWIVSRHGVRAP 
TKATQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYLGHYQRQRLVADGLLAKKGCPQS 
GQVAIIADVDERTRKTGEAFAAGLAPDOVITVHTQADTSSPDPLFNPLKTGVCQLDNA 
NVTDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSI^LCLKREKQDESCSLTQALPS 
ELKVSADWSLTGAVSLASMLTEIFLLQQAQGMPEPGWGRITDSHQWNTLLSLHNAQF 
YLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFIAGHDTNLANLGG 



ALELNWTLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSLVFQTLQQMRDKTPLSLNT 

PPGEVKLTLAGCEERNAQGMCSLAGFTQI WSARIPACSL " 

mat__peptide 1877 3106 

/gene= " appA" 

/product =s"periplasmic phosphoanhydride 
phosphohydrolase " 



mutation 



mutation 



mutation 



replace (1817 . . 1819, "gcg changed to gcc") 
/ gene= " appA" 

/ s t anda r d_name " A3 mut ant " 

/note=" created fay site directed mutagenesis" 
/phenotype=" silent mutation" 
replace (3092. .3094, " ccg changed to ccc") 
/gene= "appA" 

/standard_name=" P428 mutant" 

/note=" created by site directed mutagenesis" 

/phenotype^s" silent mutation " 

replace (3095 3097 , " gcg changed to get") 

/ gene = " appA " 

/standard^name*" A429 mutant" 

/no te= "created by site directed mutagenesis" 

/phenotype=" silent mutation " 
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Figure 19 (continuedV. 

SV40 t intron 
po lyA_s i gnal 



3197 . -3810 
/note=:^"SV40 signals" 
3807 . -4047 
/note="SV40 signals" 



BASE COUNT 1257 a 814 c 843 g 1146 t 

ORIGIN 

1 GGATCCCCTT TGCTATGTAG TTTTTAATGG AAATTACAAC CCATAGTGTG TTGATAAATA 
61 GAGAGTCCTG TTTGGTTTAA GCAACCTCTG TTTCTCATAA ACTCCATAAA AACAGGAATA 
121 CTCTTTGTTT CTAGCATAAC CAAAAGATTT AGTGAATTGA AAACAATGTT CCCTTAGAGT 
181 ATAGGTCTAA TAACCCCGAA AATATTACCA TGATACTGAG CATTTGTAAG TATCTCATAG 
241 CATGTAGTAT CCATAGTCCA TCAATGAGAG AGACATTTAA CATGATTTTC ATTAATCAGG 
301 TGGAAAAGAC ATGACAACAT TCACAGGCAC TGCACAGAAC ATAGTGGTCC ACCTTGCACA 
361 TATTTCACTA AACTAGGTTT ATCTATTTTG TTGCTTTCTC TAACATCTCT GCAATGAAGC 
421 AGGTCAACAG TGCCACATAT CCTTTACTTA ACCTAAGGAA CACAATU^AAT TTTCTACATA 
4 81 TATCCTGGTT AGAGAGTGCT TAAAATAAGT TTTCCAAQAA TGGAAAAGAA ATGTTCTGAC 
541 TTAACAATTA AGACAGTATT TATTTAAAGC AAGAAATATG AGGCACACAA GAAAATATTT 
601 TGGGAAGAAA CCATTTGGTG AACAATATTT CAAATAAAAA TAGACAAACA TAGTTAATTG 
661 TAAAACATAT GTTTGACCAG CCCTTCTTTT CAATAGGCTT J\ATGTGAATA AAATGTTAAA 
721 GATTCTCTTT GGGTGGCTGC AAATTGTCCA CGAATAAGAC AAAATATAAA AATAAGGACT 
781 GAGTCTCACA AAATGAAAAG GAAATATATT CAGAAAGAGA ATCTTGAGAG AATGTGTTGT 
841 CACAAATTAA AGAAAACCTG TGGTGAATGA CATCCTGAGG CCTGAGCTAT TACTQACATT 
901 TAAGATAAAG GTAACTGTAT ACATTTGTCC CATTGAGGGG ACAAGAAAGC TGCTCTCATG 
961 TTCAGCTCTA TAATTCTTGC CTTAAACAAC TTA7ATAGAA TGATTTAAAA TATGGAGCTG 
1021 TCCATGGACC TTTGAAATAT AAAATAGTCA AGCAACTTAT CAAGGAATTA CAGATTCCTT 
1081 GATACTAACA CAGGTAAATC CCACACGTGT TTTGAGACTA CATTTGCTGG GATTTTATTG 
1141 ATGTAATAGG TCACATGTTT TTCGGGCCAA TGTTGCTGTT ATTCGGTTAC TTCAAGAGAA 
12 01 TAGTGGCAAC TGATGCTATG TATTCTAGGG GTTTGAAGTG ATGTTTCATG ATTGAAATTT 

12 61 GTAAAAGAAT AACATCATCA TTCTTAACAA TAGAACATAT AAAGTCACAC AGAAGTGACA 
1321 GTGTTTAAGC TGTACTATTG ATCAAAGAAA TTTATTACCT TCAGTTTCAA. TGGAAATAAT 

13 81 TACTGATAAT ACAAACATGT GTGAACACAC ACTAATCCTA TCCAAATGCA CAGTGATACA 
1441 CAGAAAATAT TAGCAAGTAG AATGCAATAT TTATATAACG ATTGTATTTA TCAATCAATT 
1501 GTATGTATCA ATATATGGGC TATTTTCTTA CACATGATTT TATTCAAATT TACTCTAATC 
1561 ATTGTTGAAC CATTTAGAAA AGGCATACTG GCAACTTTTC CTTACCTCAT CCAGCTGGGC 
1621 AAAAGTCCCA GTGTGGAGTA AAGGATGCAA GATTTCCTGC TCTGTTAAGT ATAAAATAAT 
1681 AGTATGAATT CJ^AAGGTGCC ATTCTTCTGC TTCTAGTTAT AAAGGCAGTG CTTGCTTCTT 
1741 CCAGCACAGA TCTGGATCTC GAGGAGCTTG GCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
1801 AGCCGCCACC ATGAAAGCCA TCTTAATCCC ATTTTTATCT CTTCTGATTC CGTTAACCCC 
1861 GCAATCTGCA TTCGCTCAGA GTGAGCCGGA GCTGAAGCTG GAAAGTGTGG TGATTGTCAG 
1921 TCGTCATGGT GTGCGTGCTC CAACCAAGGC CACGCAACTG ATGCAGGATG TCACCCCAGA 
1981 CGCATGGCCA ACCTGGCCGG TAAAACTGGG TTGGCTGACA CCGCGCGGTG GTGAGCTAAT 
2041 CGCCTATCTC GGACATTACC AACGCCAGCG TCTGGTAGCC GACGGATTGC TGGCGAAAAA 
2101 GGGCTGCCCG CAGTCTGGTC AGGTCGCGAT TATTGCTGAT GTCGACGAGC GTACCCGTAA 
2161 AACAGGCGAA GCCTTCGCCG CCGGGCTGGC ACCTGACTGT GCAATAACCG TACATACCCA 
2221 GGCAGATACG TCCAGTCCCG ATCCGTTATT TAATCCTCTA AAAACTGGCG TTTGCCAACT 
22 81 GGATAACGCG AACGTGACTG ACGCGATCCT CAGCAGGGCA GGAGGGTCAA TTGCTGACTT 
2341 TACCGGGCAT CGGCAAACGG CGTTTCGCGA ACTGGAACGG GTGCTTAATT TTCCGCAATC 
24 01 AAACTTGTGC CTTAAACGTG AGAAACAGGA CGAAAGCTGT TCATTAACGC AGGCATTACC 
2461 ATCGGAACTC AAGGTGAGCG CCGACAATGT CTCATTAACC GGTGCGGTAA GCCTCGCATC 
2521 AATGCTGACG GAGATATTTC TCCTGCAACA AGCACAGGGA ATGCCGGAGC CGGGGTGGGG 
2581 AAGGATCACC GATTCACACC AGTGGAACAC CTTGCTAAGT TTGCATAACG CGCAATTTTA 
2641 TTTGCTACAA CGCACGCCAG AGGTTGCCCG CAGCCGCGCC ACCCCGTTAT TAGATTTGAT 
27 01 CAAGACAGCG TTGACGCCCC ATCCACCGCA AAAACAGGCG TATGGTGTGA CATTACCCAC 
2761 TTCAGTGCTG TTTATCGCCG GACACGATAC TAATCTGGCA AATCTCGGCG GCGCACTGGA 
2 821 GCTCAACTGG ACGCTTCCCG GTGAGCCGGA TAACACGCCG CCAGGTGGTG AACTGGTGTT 
2 8 81 TGAACGCTGG CGTCGGCTAA GCGATAACAG CCAGTGGATT CAGGTTTCGC TGGTCTTCCA 
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Figure 19 fcontinuedV 

2941 GACTTTACAG CAGATGCGTG ATAAAACGCC 
3001 GAAACTGACC CTGGCAGGAT GTGAAGAGCG 
3061 TTTTACGCAA ATCGTGAATG AAGCACGCAT 
3121 GTTATTGGTG CCCTTAAACG CCTGGTGCTA 
3181 TGGCAGAAAT TCGCCGGATC TTTGTGAAGG 
3241 ACAAACTACC TACAGAGATT TAAAGCTCTA 
3301 GTGTTAAACT ACTGATTCTA ATTGTTTGTG 
33 61 AATGGGAGCA GTGGTGGAAT GCCTTTAATG 
3421 CATCTAGTGA TGATGAGGCT ACTGCTGACT 
3481 GAAAGGTAGA AGACCCCAAG GACTTTCCTT 
3541 TGTTTAGTAA TAGAACTCTT GCTTGCTTTG 
3601 TGCTATACAA GAAAATTATG GAAAAATATT 
3661 ATAATCATAA CATACTGTTT TTTCTTACTC 
3721 ACTATGCTCA AAAATTGT6T ACCTTTAGCT 
3781 ATTTGATGTA TAGTGCCTTG ACTAGAGATC 
3841 TTACTTGCTT TAAAT^AACCT CCCACACCTC 
3901 ATTGTTGTTG TTAACTTGTT TATTGCAGCT 
3961 ACAAATTTCA CAAATAAAGC ATTTTTTTCA 
4021 ATCAATGTAT CTTATCATGT CTGGATCGAT 

// 



GCTGTCATTA AATACGCCGC CCGGAGAGGT 
AAATGCGCAG GGCATGTGTT CGTTGGCAGG 
ACCCGCTTGC AGTTTGTAAG GTATAAGGCA 
CGCCTGAATA AGTGATAATA AGCGGATGAA 
AACCTTACTT CTGTGGTGTG ACATAATTGG 
AGGTAAATAT AAAATTTTTA AGTGTATAAT 
TATTTTAGAT TCCAACCTAT GGAACTGATG 
AGGATU^CCT GTTTTGCTCA GAAGAAATGC 
CTCAACATTC TACTCCTCCA AAAAAGAAGA 
CAGAATTGCT AAGTTTTTTG AGTCATGCTG 
CTATTTACAC CACAAAGGAA AAAGCTGCAC 
CTGTAACCTT TATAAGTAGG CATAACAGTT 
CACACAGGCA TAGAGTGTCT GCTATIAATA 
TTTTAATTTG TAAAGGGGTT AATAAGGAAT 
ATAATCAGCC ATACCACATT TGTAGAGGTT 
CCCCTGAACC TGAAACATAA AATGAATGCA 
TATAATGGTT ACAAATAAAG CAATAGCATC 
CTGCATTCTA GTTGTGGTTT GTCCAAACTC 
CCCCGGGTAC 
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ire 20: Nucleic acid sequence of the known segment of the R15/appa plasmid fincluding 

the vector sequences of pBLCATS CSEO ID NO:4y 



LOCUS 

DEFINITION 
ACCESSION 
REFERENCE 
SOURCE 



15-APR-2000 



KEYWORDS 



AUTHORS 
JOURNAL 



R15/appa 6116 bp DNA SYN 

R15/appa transgene with vector 
R15/appa 

1 (bases 1 to 6116) 
synthetic construct. 
ORGANISM synthetic construct 

artificial sequence, 
salivary proline-rich protein, acid glucose -1 -phosphatase; appA 
gene; periplasmic phosphoanhydride phosphohydrolase; artificial 
sec[uence ; 

Golovan, S., Forsberg, C.W-, Phillips, J, 
Unpublished. 



DEFINITION 
ACCESSION 
VERSION 
SOURCE 

ORGANISM 

Mammalia; 

Rattus . 

REFERENCE 

AUTHORS 

TITLE 
encoding 

JOURNAL 
MEDLINE 
FEATURES 

source 



misc feature 



Rat salivary proline -rich protein (RP15) gene. 
M64793 M36414 
M64793.1 GI: 206711 

Rat (Sprague-Dawley) liver DNA. 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; 

1 (bases 1 to 1748) 
Lin,H.H. and Axin,I> .K, 

Molecular characterization of rat multigene family 

proline -rich proteins 
Genomics 10, 102-113 (1991) 
91257817 

Location/Qualifiers 
1. .1748 

/organisni="Rattus norvegicus" 
/s trains" Sprague-Dawley" 
/db_xref ="taxon : 10116 " 
/ 1 i s sue_t ype s= " 1 i ve r " 

/tissue^libs^cosmid genomic library" 

1802-1810 

/function*" consensus sequence for initiation in 

higher eukaryotes 



FEATURES Location/ Qualifiers 

DEFINITION E. coli periplasmic phosphoanhydride phosphohydrolase (appA) 
gene, 



ACCESSION 
VERSION 
SOURCE 
ORGANISM 



M58708 L03370 L03371 L03372 L03373 L03374 L03375 
MS8708.1 GI:145283 
Escherichia coli DNA. 
Escherichia coli 



Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae ; 
Escherichia . 



PXFERENCS 1 
AUTHORS 
TITLE 



JOUPJJ.2LL 



(bases 18I1..3109) 

Dassa, J. , Marck,C. and Boquet,P.L. 

The complete nucleotide sequence of the Escherichia coli gene appA 
reveals significant homology between pH 2.5 acid phosphatase 
and glucose- l-phosphatas€ 
Bacterid. 172' {97, 5497- 



J. 



5500 {1990) 
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Figgre 20 (continued) : 



HEDIiINE 503 6 8616 



FEATURES 

Source 



sig_peptide 
/genes "appA" 
CDS 



Locat ion/Qual if iers 

1811 . .3109 
/organism- "Escherichia coli" 
/db_xref ="taxonr562" 

1811. . 1876 

1811. .3109 

/gene= " appA" 
/ standard_natne= " acid phosphatase/phytase " 
/trans l_^table= 11 

/product=''periplasmic phosphoanhydride phosphohydrolase' 
/protein_id="AAA72086 .1" 
/db xref="GI: 145285" 



/translation««MKAILIPFLSIiLIPLTPOSAFAQSEPELKLESWIVSRHGVRAP 
TKATQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYLGHyQRQRLVADGU^AKKGCPQS 
GQVAIIADVDERTRKTGEAFAAGIAPDCAITVHTQADTSSPDPIiFNPLKTGVCQLDNA 
]mnDAIIiSRAGGSIADFTGHRQTAFRELERVIiNFPQSNLCLKREKQDESCSLTOAL.PS 
ELKVSADNVSLTGAVSIA^MIjTEIFLLQQAQGMPEPGWGRITDSHQWNTLLSLHNAQF 
YliLQRTPEVARSRATPLIJDLIKTALTPHPPQKQAYGVTLPTSVLFIAGHDTNLANliGG 



ALELNWTLPGQPDNTPPGGErjVFERWRRLSDNSQWIQVSLVFOTL0QMRDKTPI.SIiNT 

PPGEVKLTLAGCEERNAQQ^CSLAGFTQIVNEARIPACSL" 



mat_peptide 



mutation 



mutation 



mutation 



1877 3106 
/gene= " appA" 

/products" per ipl a smic phosphoanhydride phosphohydrolase" 

replace (1817 - , 1819, "gcg changed to gcc*) 
/gene="appA" 

/ s tandard_naine- "A3 mutant " 

/noce= "created by site directed mutagenesis" 
/phenotype= "silent mutation" 
replace (3 092 3094 , " ccg changed to ccc**) 
/gene=" appA" 

/standard_name= " P42 8 mutant" 

/notes" created by site directed mutagenesis" 

/phenotype=" silent mutation " 

replace (3095 . ,3097, gcg changed to get") 

/genes "appA" 

/standard_names=" A429 mutant" 

/not €= "created by site directed mutagenesis" 

/phenotype=" silent mutation " 



DEFINITION Plasmid pBLCAT3 (bases 3109 to 6116) 



ACCESSION 

VERSION 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOUPJJAL 



X64409 

X64409.1 GI:58163 
synthetic construct . 
synthetic construct 
artificial sequence . 
1 (bases 3109 to 6116) 
l4Uckow,B.H.R. 
Direct Submission 

Submitted { 06 -FEB- 1992) B.H.R. Luckow, German Cancer Res 
Center, Im Neuenheiner Feld 280, W-6900 Heidelberg, FRG 
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Figure 20 (continued): 



REFERENCE 
AUTHORS 
TITLE 

for 

regulatory 

JOURNAL 
MEDLINE 
COMMENT 
experiments 



FEATURES 

source 



polyA^signal 



CDS 



2 (bases 3109 to €116) 
Luckow,B. and Schutz^G. 

CAT constructions with multiple unique restriction sites 
the functional analysis of eukaryotic promoters and 
elements 

Nucleic Acids Res. 15 (13), 5490 (1987) 
87260024 

Promoterless CAT vector for transient transfection 

with eukaryotic cells. Allows the analysis of foreign 
promoters and enhancers - 

Location/Qualifiers 

3109 to 6116 

/organism= " synthetic construct" 
/ db__xref = " taxon : 3 2 63 0 " 
3262. ,3457 

/note="SV40 signals" 

complement (4 654 . .5514) 
/ codon_s t ar t = 1 
/ trans l_tab 1 e = 1 1 
/gene="An^" 

/products "beta - lactamase " 
/protein_id="CAA45753 .1" 
/db xref="GI:S8165" 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 



1724 a 1386 c 1407 g 1599 t 



GGATCCCCTT TGCTATGTAG 
GAGAGTCCTG TTTGGTTTAA 
CTCTTTGTTT CTAGCATAAC 
ATAGGTCTAA TAACCCCGAA 
CATGTAGTAT CCATAGTCCA 
TGGAAAAGAC ATGACAACAT 
TATTTCACTA AACTAGGTTT 
AGGTCAACAG TGCCACATAT 
TATCCTGGTT AGAGAGTGCT 
TTAACAATTA AGACAGTATT 
TGGGAAGAAA CCATTTGGTG 
TAAAACATAT GTTTGACCAG 
GATTCTCTTT GGGTGGCTGC 
GAGTCTCACA AAATGAAAAG 
CACAAATTAA AGAAAACCTG 
901 TAAGATAAAG GTAACTGTAT 
961 TTCAGCTCTA TAATTCTTGC 
1021 TCCATGGACC TTTGAAATAT 
1081 GATACTAACA CAGGTAAATC 
1141 ATGTAATAGG TCACATGTTT 
1201 TAGTGGCAAC TGATGCTATG 
1261 GTAAAAGAAT AACATCATCA 
13 21 GTGTTTAAGC TGTACTATTG 
13 81 TACTGATAAT ACAAACATGT 
1441 CAGAAAATAT TAGCAAGTAG 
1501 GTATGTATCA ATATATGGGC 
15 61 ATTGTTGAZ^C CATTTAGAAA 
1621 AAAAGTCCCA GTGTGGAGTA 



TTTTTAATGG 

GCAACCTCTG 

CAAAAGATTT 

AATATTACCA 

TCAATGAGAG 

TCACAGGCAC 

ATCTATTTTG 

CCTTTACTTA 

TAAAATAAGT 

TATTTAAAGC 

AACAATATTT 

CCCTTCTTTT 

AAATTGTCCA 

GAAATATATT 

TGGTGAATGA 

ACATTTGTCC 

CTTAAACAAC 

AAAATAGTCA 

CCACACGTGT 

TTCGGGCCAA 

TATTCTAGGG 

TTCTTAACAA 

ATCAAAGAAA 

GTGAACACAC 

AATGCAATAT 

TATTTTCTTA 

AGGCATACTG 

A^^GGATGCAA 
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AAATTACAAC 

TTTCTCATAA 

AGTGAATTGA 

TGATACTGAG 

AGACATTTAA 

TGCACAGAAC 

TTGCTTTCTC 

ACCTAAGGAA 

TTTCCAAGAA 

AAGAAATATG 

CAAATAAAAA 

CAATAGGCTT 

CGAATAAGAC 

CAGAAAGAGA 

CATCCTGAGG 

CATTGAGGGG 

TTAAATAGAA 

AGCAACTTAT 

TTTGAGACTA 

TGTTGCTGTT 

GTTTGAAGTG 

TAGAACATAT 

TTTATTACCT 

ACTAATCCTA 

TTATATAACG 

CACATGATTT 

GCAACTTTTC 

GATTTCCTGC 



CCATAGTGTG 

ACTCCATAAA 

AAACAATGTT 

CATTTGTAAG 

CATGATTTTC 

ATAGTGGTCC 

TAACATCTCT 

CACAAAAAAT 

TGGAAAAGAA 

AGGCACACAA 

TAGACAAACA 

AATGTGAATA 

AAAATATAAA 

ATCTTGAGAG 

CCTGAGCTAT 

ACAAGAAAGC 

TGATTTAAAA 

CAAGGAATTA 

CATTTGCTGG 

ATTCGGTTAC 

ATGTTTCATG 

AAAGTCACAC 

TCAGTTTCAA 

TCCAAATGCA 

ATTGTATTTA 

TATTCAAATT 

CTTACCTCAT 

TCTGTTAAGT 



TTGATAAATA 

AACAGGAATA 

CCCTTAGAGT 

TATCTCATAG 

ATTAATCAGG 

ACCTTGCACA 

GCAATGAAGC 

TTTCTACATA 

ATGTTCTGAC 

GAAAATATTT 

TAGTTAATTG 

AAATGTTAAA 

AATAAGGACT 

AATGTGTTGT 

TACTGACATT 

TGCTCTCATG 

TATGGAGCTG 

CAGATTCCTT 

GATTTTATTG 

TTCAAGAGAA 

ATTGAAATTT 

AGAAGTGACA 

TGGAAATAAT 

CAGTGATACA 

TCAATCAATT 

TACTCTAATC 

CCAGCTGGGC 

ATAAAATA?«-T 



BNSOOCID: <WO_0064247A1 I > 
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Figure 20 (continued): 

1681 AGTATGAATT 
1741 CCAGCACAGA 
1801 AGCCGCCACC 
1861 GCAATCTGCA 
1921 TCGTCATGGT 
1981 CGCATGGCCA 
2041 CGCCTATCTC 
2101 GGGCTGCCCG 
2161 AACAGGCGAA 
2221 GGCAGATACG 
22 81 GGATAACGCG 
2341 TACCGGGCAT 
2401 AAACTTGTGC 
2461 ATCGGAACTC 
2521 AATGCTGACG 
25 61 AAGGATCACC 
2 641 TTTGCTACAA 
2 7 01 CAAGACAGCG 
2761 TTCAGTGCTG 
2 821 GCTCAACTGG 
2 881 TGAACGCTGG 

2 941 GACTTTACAG 

3 0 01 GAAACTGACC 
3061 TTTTACGCAA 
3121 GTTATTGGTG 
3181 TGGCAGAAAT 
3241 ACAAACTACC 
3301 AATGAATGCA 
3361 CAATAGCATC 
3421 GTCCAAACTC 
3481 TTCGTAATCA 
3 541 CAACATACGA 
3601 CACATTAATT 
3661 GCATTAATGA 
3721 TTCCTCGCTC 
3781 CTCAAAGGCG 
3 841 AGCAAAAGGC 

3 901 TAGGCTCCGC 
3961 CCCGACAGGA 
4021 TGTTCCGACC 

4 081 GCTTTCTCAA 
4141 GGGCTGTGTG 

42 01 TCTTGAGTCC 
4261 GATTAGCAGA 
4321 CGGCTACACT 

43 81 AAAAAGAGTT 
4441 TGTTTGCAAG 
4501 TTCTACGGGG 
45 61 ATTATCAAAA 
4621 CTAAAGTATA 
4631 TATCTCAGCG 
4741 AACTACGATA 
4 801 ACGCTCACCG 
4861 AAGTGGTCCT 
4 921 AGTAAGTAGT 
4 951 GGTGTCACGC 
504 1 AGTTACATGA 
5101 TGTCAGAAGT 



BNSDOCiD- <WO_0064247A1 I > 



PCT/CAOO/00430 



CAAAGGTGCC ATTCTTCTGC TTCTAGTTAT AAAGGCAGTG CTTGCTTCTT 
TCTGGATCTC GAGGAGCTTG GCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
ATGAAAGCCA TCTTAATCCC ATTTTTATCT CTTCTGATTC CGTTAACCCC 
TTCGCTCAGA GTGAGCCGGA GCTGAAGCTG GAAAGTGTGG TGATTGTCAG 
GTGCGTGCTC CAACCAAGGC CACGCAACTG ATGCAGGATG TCACCCCAGA 
ACCTGGCCGG TAAAACTGGG TTGGCTGACA CCGCGCGGTG GTGAGCTAAT 
GGACATTACC AACGCCAGCG TCTGGTAGCC GACGGATTGC TGGCGAAAAA 
CAGTCTGGTC AGGTCGCGAT TATTGCTGAT GTCGACGAGC GTACCCGTAA 
GCCTTCGCCG CCGGGCTGGC ACCTGACTGT GCAATAACCG TACATACCCA 
TCCAGTCCCG ATCCGTTATT TAATCCTCTA AAAACTGGCG TTTGCCAACT 
AACGTGACTG ACGCGATCCT CAGCAGGGCA GGAGGGTCAA TTGCTGACTT 
CGGCAAACGG CGTTTCGCGA ACTGGAACGG GTGCTTAATT TTCCGCAATC 
CTTAAACGTG AGAAACAGGA CGAAAGCTGT TCATTAACGC AGGCATTACC 
AAGGTGAGCG CCGACTU^TGT CTCATTAACC GGTGCGGTAA GCCTCGCATC 
GAGATATTTC TCCTGCAACA AGCACAGGGA ATGCCGGAGC CGGGGTGGGG 
GATTCACACC AGTGGAACAC CTTGCTAAGT TTGCATAACG CGCAATTTTA 
CGCACGCCAG AGGTTGCCCG CAGCCGCGCC ACCCCGTTAT TAGATTTGAT 
TTGACGCCCC ATCCACCGCA AAAACAGGCG TATGGTGTGA CATTACCCAC 
TTTATCGCCG GACACGATAC TAATCTGGCA AATCTCGGCG GCGCACTGGA 
ACGCTTCCCG GTGAGCCGGA TAACACGCCG CCAGGTGGTG AACTGGTGTT 
CGTCGGCTAA GCGATAACAG CCAGTGGATT CAGGTTTCGC TGGTCTTCCA 
CAGATGCGTG ATAAAACGCC GCTGTCATTA AATACGCCGC CCGGAGAGGT 
CTGGCAGGAT GTGAAGAGCG AAATGCGCAG GGCATGTGTT CGTTGGCAGG 
ATCGTGAATQ AAGCACGCAT ACCCGCTTGC AGTTTGTAAG GTATAAGGCA 
CCCTTAAACG CCTGGTGCTA CGCCTGAATA AGTGATAATA AGCGGATGAA 
TCGCCGGATC TTTGTGAAGG AACCTTACTT CTGTGGTGTG ACATAATTGG 
TACAGAGATT TAAAAAACCT CCCACACCTC CCCCTGAACC TGAAACATAA. 
ATTGTTGTTG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG 
ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT 
ATCAATGTAT CTTATCATGT CTGGATCGAT CCCCGGGTAC CGAGCTCGAA 
TGGTCATAGC TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA 
GCCGGAAGCA TAAAGTGTAA AGCCTGGGGT GCCTAATGAG TGAGCTAACT 
GCGTTGCGCT CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT 
ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG CQTATTGGGC GCTCTTCCGC 
ACTGACTCGC TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TATCAGCTCA 
GTAATACGGT TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACATGTG 
CAGCAAAAGG CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA 
CCCCCTGACG AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA 
CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC 
CTGCCGCTTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC 
TGCTCACGCT GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT 
CACGAACCCC CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG 
AACCCGGTAA GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG 
GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA 
AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG 
GGTAGCTCTT GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT 
CAGCAGATTA CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT 
TCTGACGCTC AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG 
AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT 
TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTTAATCA GTGAGGCACC 
ATCTGTCTAT TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT 
CGGGAGGGCT TACCA.TCTGG CCCCAGTGCT GCAATGATAC CGCGAGACCC 
GCTCCAGATT TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG 
GCAACTTTAT CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG 
TCGCCAGTTA ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT 
TCGTCGTTTG GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG 
TCCCCCATGT TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCGATCGT 
ArtGTTGGCCG CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC 

42/58 

RECTIFIED SHEET (RULE 91) 



wo 00/64247 



PCT/CAOO/00430 



Figure 20 (continued^ 

5161 TCTTACTGTC ATGCCATCCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC 
5221 ATTCTGAGAA TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA TACGGGATAA 
52 81 TACCGCGCCA CATAGCAGAA CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG 
5341 AAAACTCTCA AGGATCTTAC CGCTGTTGAG ATCCAGTTCG ATGTAACCCA CTCGTGCACC 
54 01 CAACTGATCT TCAGCATCTT TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG 
5461 GCAAAATGCC GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT 
5521 CCTTTTTCAA TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT 
5581 TGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC 
5641 ACCTGACGTC TAAGAAACCA TTATTATCAT GACATTAACC TATAAAAATA GGCGTATCAC 
5701 GAGGCCCTTT CGTCTCGCGC GTTTCGGTGA TGACGGTGAA AACCTCTGAC ACATGCAGCT 
5761 CCCGGAGACG GTCACAGCTT GTCTGTAAGC GGATGCCGGG AGCAGACAAG CCCGTCAGGG 
5821 CGCGTCAGCG GGTGTTGGCG GGTGTCGGGG CTGGCTTAAC TATGCGGCAT CAGAGCAGAT 
58 81 TGTACTGAGA GTGCACCATA TGCGGTGTGA AATACCGCAC AGATGCGTAA GGAGAAAATA 

5 941 CCGCATCAGG CGCCATTCGC CATTCAGGCT GCGCAACTGT TGGGAAGGGC GATCGGTGCG 

6 0 01 GGCCTCTTCG CTATTACGCC AGCTGGCGAA AGGGGGATGT GCTGCAAGGC GATTAAGTTG 
6061 GGTAACGCCA GGGTTTTCCC AC^TCACGACG TTGTAAAACG ACGGCCAGTG CCAAGC 
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Figure 21: Nucleic acid sequence of the known segment of the R15/appa trans gene used for 
the generation of transgenic mice fSEO ID NO:SV 



LOCUS 

DEFINITION 
ACCESSION 
REFERENCE 
SOURCE 



RlS/appa 3470 bp DNA SYN 15-APR-2000 

R15/appa transgene with vector sequences removed. 

RlS/appa 

1 (bases 1 to 3470) 
synthetic construct . 



ORGANISM 



KEYWORDS 



AUTHORS 
JOURNAL 



synthetic construct 
artificial sequence, 
salivary proline-rich protein^ acid glucose-l-phosphatase ; appA 
gene; periplasmic phosphoanhydride phosphohydrolase; artificial 
sequence ; 

Golovan, S., Forsberg, C.W., Phillips, J. 
Unpublished . 



DEFINITION 
ACCESSION 
VERSION 
SOURCE 

ORGANISM 

Mammalia; 

Rattus . 

REFERENCE 
AUTHORS 
TITIiE 

encoding 

JOURNAL 
MEDLINE 
FEATURES 

source 



misc feature 



Rat salivary proline-rich protein (RP15) gene, 

M64793 M36414 

M64793 .1 GI: 20 6711 

Rat (Sprague-Dawley) liver DNA. 

Rattus norvegi cus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; 

1 (bases 1 to 1748) 
Lin,H.H. and Ann,D.K. 

Molecular characterization of rat multigene family 

proline-rich proteins 
Genomics 10, 102-113 (1991) 
91257817 

Location/ Qualifiers 
1. .1748 

/organism- "Rattus norvegi cus" 
/strain^" Sprague-Dawley" 
/db_xref=:"taxon: 10116" 
/tissue_type=" liver" 

/tissue_lib=s"cosmid genomic library" 
1802-1810 

/functions" consensus sequence for initiation in 

higher eukaryotes 



FEATURES Location/Qualifiers 

DEFINITION E. coli periplasmic phosphoanhydride phosphohydrolase (appA) 
gene, 

ACCESSION M58708 L03370 L03371 L03372 L03373 L03374 L03375 
VERSION M58708.1 GI: 145283 

SOURCE Escherichia coli DNA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; 

Escherichia . 

REFERENCE 1 (bases 1811. .3109) 

AUTHORS Dassa,J., Marck,C. and 3oquet,P.L. 

TITLE The complete nucleocide sequence of the Escherichia coli cene appA 

reveals significant home logy between pH 2.5 acid phosphatase 
and glucose- 1-phosphacase 
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Figure 21 (continued'i: 



JOURNAIi 
MEDLINE 



J. Bacterid. 172 (9), 5497-5500 (1990) 
90368616 



FEATURES 

Source 



sig_pept:ide 



CDS 



Location/Qualifiers 

1811. .3109 
/orgaiiism=" Escherichia coli" 
/db_xref = " taxon: 562 " 

1811. . 1876 
/gene= " appA" 
1811. -3109 
/gene=" appA" 

/standard_name=: "acid phosphatase/phytase" 
/transl_table=ll 

/products "periplasmic phosphoanhydride phosphohydrotase ' 
/ prot e in_i d= " AAA7 2 0 B 6 . 1 " 
/db xref«"GX: 145285" 



/translation^ "MKAILIPFLSLLIPLTPQSAFAQSEPELKLESWIVSRHGVRAP 

TKATQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYLGHYQRQRLVADGLLAKKGCPQS 

GQVAIIADVDSRTRKTGEAFAAGIAPDCAITVHTQADTSSPDPLFNPIiKTGVCQLDNA 

NVTDAILSRAGGSIADFTGHRQTAFRELERVl4NFPQSm*CLKREKQDESCSLTQALPS 

ELKVSADNVSLTGAVSLASMLTEIFLIiQQAQGMPEPGWGRITDSHQWNTIiLSI^ 

YLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFIAGHDTNLANLGG 



ALEIiNWrLPGQPDNTPPGGELVFERWRRI.SDNSQWIOVSljVFQTLQQMRDKTPL»SLNT 

PPGEVKLTLAGCEERNAOGMCSLAGFTQIVNEARIPACSL" 



mat peptide 



mutation 



mutation 



mutation 



1877 3106 
/gene= " appA" 

/products "periplasmic phosphoanhydride piiosphohydrolase " 

replace (1817 . . 1819, "gcg changed to gcc") 
/gene= "appA" 

/standard_name=t "A3 mutant" 

/note= "created by site directed mutagenesis" 

/phenotype=" silent mutation" 

replace (3092 . .3094, " ccg changed to ccc") 

/gene-*'appA" 

/ standard_name~ " P428 mutant" 

/note=" created by site directed mutagenesis" 

/phenotypea" silent mutation " 

replace (3095 3097 , " gcg changed to get") 

/gene=" appA" 

/standard_name= " A429 mutant" 

/note=" created by site directed mutagenesis" 

/phenotypesa " silent mutation " 



polyA_signal 3262. ,3457 

/note=:"SV40 signals" 

BASE COUNT 1065 a 721 c 735 g 949 t 

ORIGIN 

1 GGA.TCCCCTT TGCTATGTAG TTTTTAATGG AAATTACAAC CCATAGTGTG TTGATAAATA 

61 GAGAGTCCTG TTTGGTTTAA GCAACCTCTG TTTCTCATAA ACTCCATAAA AACAGGAATA 

121 CTCTTTGTTT CTAGCATAAC CAAAAGATTT AGTGAATTGA AAACAATGTT CCCTTAGA.GT 

181 ATAGGTCTAA TAACCCCGAA AATA.TTA.CCA TGA.TACTGAG CATTTGTAAG TATCTCATAG 

241 CA-TGTA.GTAT CCATAGTCCA TCAATGAGAG ACACA.TTTAA CATGATTTTC ATTAATCA.GG 
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Figure 21 (continuedV> 

301 TGGAAAAGAC ATGACAAjCAT TCACAGGCAC TGCACAGAAC ATAGTGGTCC ACCTTGCACA 
3 61 TATTTCACTA AACTAGGTTT ATCTATTTTG TTGCTTTCTC TAACATCTCT GCAATGAAGC 
421 AGGTCAACAG TGCCACATAT CCTTTACTTA ACCTAAGGAA CACAAAAAAT TTTCTACATA 
481 TATCCTGGTT AGAGAGTGCT TAAAATAAGT TTTCCAAGAA TGGAAAAGAA ATGTTCTGAC 
541 TTAACAATTA AGACAGTATT TATTTAAAGC AAGAAATATG AGGCACACAA GAAAATATTT 
eOl TGGGAAGAAA CCATTTGGTG AACAATATTT CAAATAAAAA TAGACAAACA TAGTTAATTG 
661 TAAAACATAT GTTTGACCAG CCCTTCTTTT CAATAGGCTT AATGTGAATA AAATGTTAAA 
721 GATTCTCTTT GGGTGGCTGC AAATTGTCCA CGAATAAGAC AAAATATAAA AATAAGGACT 
781 GAGTCTCACA AAATGAAAAG GAAATATATT CAGAAAGAGA ATCTTGAGAG AATGTGTTGT 
841 CACAAATTAA AGAAAACCTG TGGTGAATGA CATCCTGAGG CCTGAGCTAT TACTGACATT 
901 TAAGATAAAG GTAACTGTAT ACATTTGTCC CATTGAGGGG ACAAGAAAGC TGCTCTCATG 
961 TTCAGCTCTA TAATTCTTGC CTTAAACAAC TTAAATAGAA TGATTTAAAA TATGGAGCTG 
1021 TCCATGGACC TTTGAAATAT AAAATAGTCA AGCAACTTAT CAAGGAATTA CAGATTCCTT 
1081 GATACTAACA CAGGTAAATC CCACACGTGT TTTGAGACTA CATTTGCTGG GATTTTATTG 
1141 ATGTAATAGG TCACATGTTT TTCGGGCCAA TGTTGCTGTT ATTCGGTTAC TTCAAGAGAA 

12 01 TAGTGGCAAC TGATGCTATG TATTCTAGGG GTTTGAAGTG ATGTTTCATG ATTGAAATTT 
1261 GTAAAAGAAT AACATCATCA TTCTTAACAA TAGAACATAT AAAGTCACAC AGAAGTGACA 
1321 GTGTTTAAGC TGTACTATTG ATCAAAGAAA TTTATTACCT TCAGTT-TCAA TGGAAATAAT 

13 81 TACTGATAAT ACAAACATGT GTGAACACAC ACTAATCCTA TCCAAATGCA CAGTGATACA 
1441 CAGAAAATAT TAGCAAGTAG AATGCAATAT TTATATAACG ATTGTATTTA TCAATCAATT 
15 01 GTATGTATCA ATATATGGGC TATTTTCTTA CACATGATTT TATTCAAATT TACTCTAATC 
1561 ATTGTTGAAC CATTTAGAAA AGGCATACTG GCAACTTTTC CTTACCTCAT CCAGCTGGGC 
1621 AAAAGTCCCA GTGTGGAGTA AAGGATGCAA GATTTCCTGC TCTGTTAAGT ATAAAATAAT 
1681 AGTATGAATT CAAAGGTGCC ATTCTTCTGC TTCTAGTTAT AAAGGCAGTG CTTGCTTCTT 
1741 CCAGCACAGA TCTGGATCTC GAGGAGCTTG GCGAGATTTT CAGGAGCTAA GGAAGCTAAA 
1801 AGCCGCCACC ATGAAAGCCA TCTTAATCCC ATTTTTATCT CTTCTGATTC CGTTAACCCC 
1861 GCAATCTGCA TTCGCTCAGA GTGAGCCGGA GCTGAAGCTG GAAAGTGTGG TGATTGTCAG. 
1921 TCGTCATGGT GTGCGTGCTC CAACCAAGGC CACGCAACTG ATGCAGGATG TCACCCCAGA 
1981 CGCATGGCCA ACCTGGCCGG TAAAACTGGG TTGGCTGACA CCGCGCGGTG GTGAGCTAAT 
2 041 CGCCTATCTC GGACATTACC AACGCCAGCG TCTGGTAGCC GACGGATTGC TGGCGAAAAA 
2101 GGGCTGCCCG CAGTCTGGTC AGGTCGCGAT TATTGCTGAT GTCGACGAGC GTACCCGTAA 
2161 AACAGGCGAA GCCTTCGCCG CCGGGCTGGC ACCTGACTGT GCAATAACCG TACATACCCA 
2221 GGCAGATACG TCCAGTCCCG ATCCGTTATT TAATCCTCTA AAAACTGGCG TTTGCCAACT 
2281 GGATAACGCG AACGTGACTG ACGCGATCCT CAGCAGGGCA GGAGGGTCAA TTGCTGACTT 
2341 TACCGGGCAT CGGCAAACGG CGTTTCGCGA ACTGGAACGG GTGCTTAATT TTCCGCAATC 
2 401 AAACTTGTGC CTTAAACGTG AGAAACAGGA CGAAAGCTGT TCATTAACGC AGGCATTACC 
24 61 ATCGGAACTC AAGGTGAGCG CCGACAATGT CTCATTAACC GGTGCGGTAA GCCTCGCATC 
2521 AATGCTGACG GAGATATTTC TCCTGCAACA AGCACAGGGA ATGCCGGAGC CGGGGTGGGG 
2581 AAGGATCACC GATTCACACC AGTGGAACAC CTTGCTAAGT TTGCATAACG CGCAATTTTA 
2641 TTTGCTACAA CGCACGCCAG AGGTTGCCCG CAGCCGCGCC ACCCCGTTAT TAGATTTGAT 
2701 CAAGACAGCG TTGACGCCCC ATCCACCGCA AAAACAGGCG TATGGTGTGA CATTACCCAC 
27 61 TTCAGTGCTG TTTATCGCCG GACACGATAC TAATCTGGCA AATCTCGGCG GCGCACTGGA 
2821 GCTCAACTGG ACGCTTCCCG GTGAGCCGGA TAACACGCCG CCAGGTGGTG AACTGGTGTT 

2 881 TGAACGCTGG CGTCGGCTAA GCGATAACAG CCAGTGGATT CAGGTTTCGC TGGTCTTCCA 
2941 GACTTTACAG CAGATGCGTG ATAAAACGCC GCTGTCATTA AATACGCCGC CCGGAGAGGT 
3001 GAAACTGACC CTGGCAGGAT GTGAAGAGCG AAATGCGCAG GGCATGTGTT CGTTGGCAGG 
3061 TTTTACGCAA ATCGTGAATG AAGCACGCAT ACCCGCTTGC AGTTTGTAAG GTATAAGGCA 
3121 GTTATTGGTG CCCTTAAACG CCTGGTGCTA CGCCTGAATA AGTGATAATA AGCGGATGAA 
3181 TGGCAGAAAT TCGCCGGATC TTTGTGAAGG AACCTTACTT CTGTGGTGTG ACATAATTGG 
3241 ACAAACTACC TACAGAGATT TAAAAAACCT CCCACACCTC CCCCTGAACC TGAAACATAA 
33 01 AATGAATGCA ATTGTTGTTG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG 

3 361 CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT 
3421 GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCGAT CCCCGGGTAC 
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are 22: Nucleic acid sequence of the SV40/APPA+intron plasmid (SEP ID NO:6V 

LOCUS SV4 0/APPA 5421 bp DNA CIRCUIAR SYN 14-APR-2000 

DEFINITION Ligation of SV4 0 promoter/enhancer into CAT/APPA+intron 
ACCESS ION S V4 0 / APPA 
REFERENCE 1 (bases 1 to 5421) 

SOURCE synthetic construct. 

ORGANISM synthetic construct 

artificial sequence. 
KEYWORDS SV4 0 promoter /enhancer , acid glucose-1 -phosphatase ; appA gene; 

periplasmic phospho anhydride phosphohydrolase; artificial 
sequence ; 

AUTHORS Golovan, S., Forsberg, C.W., Phillips, J. 

JOURNAL Unpublished. 

DEFINITION E. coli periplasmic phosphoanhydride phosphohydrolase (appA) 
gene, 

ACCESSION M5fi708 L03370 L03371 L03372 L03373 L03374 L03375 

VERSION M5870e.l GI: 145283 

SOURCE Escherichia coli DNA. 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae; 

Escherichia . 

REFERENCE 1 (bases 40 1337) 

AUTHORS Dassa , J . , Marck , C - and Boqfuet , P . L , 

TITLE The complete nucleotide sequence of the Escherichia coli gene appA 

reveals significant homology between pH 2.5 acid phosphatase 
and glucose-1 -phosphatase 

JOURNAL J- Bacteriol. 172 O) , 5497-S500 (1990) 

MEDLINE 90368616 

FEATURES Location/Qualifiers 
Source 4 0 1337 

/organisms "Escherichia coli" 
/db^xref = " taxon : 562 " 

sig_peptxde 40.. 105 

/gene= " appA" 
CDS 40 1337 

/ gene = " app A " 
/standard_name= "acid phosphatase/phytase " 

/ transl_table»ll 

/products "periplasmic phosphoanhydride phosphohydrolase" 
/protein_id«"AAA72086 .1" 
/db_xref ="GI:145285" 

/translation="MKAILIPFLSLLIPLTPQSAFAQSEPELKLESWIVSRHGVRAP 

TKATQIJ^QDVTPDAWPTWPVKLGWLTPRGGELIAYLGHYORQRLVADGLLAKKGCPQS 

GQVAI I ADVDERTRKTGEAFAAGLAPDCAI TVHTQADTSS PDPLFNPLKTGVCQLDNA 

NVTDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSNLCLKREKQDESCSLTQALPS 

ELKVSADNVSLTGAVSLASMLTEIFLLQQAQGMPEPGWGRITDSHQWNTLLSLHNAQF 

YLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFIAGHDTNLANLGG 

ALELNVTTLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSLVFQTLQQMRDKTPLSIiNT 

PPGEVKLTLAGCEERNAQGMCSLAGFTQIVNEARI PACSL " 

mat_peptide 106 1334 

/gene="appA" 
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Figure 22 fcontinuedV, 



mutation 



mutation 



mutatxon 



/products="periplasmic phosphoanhydride phosphohydrolase" 



replace {46.. 48, "gcg changed to gcc") 
/gene=" appA" 

/ s tandard_name ="A3 mutant " 

/riote=" created by site directed mutagenesis" 

/phenotypes" silent mutation" 

replace (1320. .1322, " ccg changed to ccc") 

/gene="appA" 

/ standard_name= " P428 mutant" 

/notes** created by site directed mutagenesis" 

/phenotype=" silent mutation " 

replace ( 1323 1325, " gcg changed to get") 

/gene = " appA " 

/standard_name=" A429 mutant" 

/note=" created by site directed mutagenesis" 

/phenotype=" silent mutation " 



DEFINITION Plasmid pBLCAT3 (bases 2200 to 4924) 
ACCESSION X644 09 

X64409.1 GI:58163 

synthetic construct . 
synthetic construct 
artificial sequence. 

1 (bases 2200 to 4924) 
Luckow , B . H . R - 
Direct Submission 

Submitted (06-FEB-1992 ) B.H.R. Luckow, German Cancer Res 
Center, Im Neuenheimer Feld 280, W-6900 Heidelberg, FRG 

2 (bases 2200 to 4924) 
Luckow, B. and Schut2,G. 

CAT constructions with multiple unique restriction sites 



VERSION 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AXTTHORS 
TITLE 



for 

regrulatory 

JOURNAL 
MEDLINE 
COMHENT 
experiments 



FEATURES 

source 



the functional analysis of eukaryotic promoters and 
elements 

Nucleic Acids Res. 15 {13), 5490 (1987) 
87260024 

Promoterless CAT vector for transient trans feet ion 

with eukaryotic cells. Allows the analysis of foreign 
promoters and enhancers . 

Location/ Qualifiers 

2200 to 4924 

/organisms "synthetic construct" 
/db xref ="taxon:3263 0" 



SV40 t intron 



polyA^signal 



CDS 



1380. . 1993 
/note="SV4 0 signals" 
1990 . .2230 
/note="SV40 signals" 
complement (3471. .4317) 
/codon_start=l 
/ trans l_table=ll 
/gene= "Amp " 

/product^ "beta- lactamase" 
/protein_id= "CAA4 57 53 .1" 
/db xref ="GI ;58165" 
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Figure 22 (continued'^: 



SV4 0 promoter / enhancer 5023. .5402 

/note= " SV40 signals " 



BASE COUNT 1413 a 1321 c 1331 g 1355 t 

ORIGIN 

1 CGAGATTTTC AGGAGCTAAG GAAGCTAAAA GCCGCCACCA TGAAAGCCAT CTTAATCCCA 
61 TTTTTATCTC TTCTGATTCC GTTAACCCCG CAATCTGCAT TCGCTCAGAG TGAGCCGGAG 
121 CTGAAGCTGG AAAGTGTGGT GATTGTCAGT CGTCATGGTG TGCGTGCTCC AACCAAGGCC 
181 ACGCAACTGA TGCAGGATGT CACCCCAGAC GCATGGCCAA CCTGGCCGGT AAAACTGGGT 
241 TGGCTGACAC CGCGNGGTGG TGAGCTAATC GCCTATCTCG GACATTACCA ACGCCAGCGT 

3 01 CTGGTAGCCG ACGGATTGCT GGCGAAAAAG GGCTGCCCGC AGTCTGGTCA GGTCGCQATT 
361 ATTGCTSATG TCGACGAGCG TACCCGTAAA ACAGGCGAAG CCTTCGCCGC CGGGCXGGCA 
421 CCTGACTGTG CAATAACCGT ACATACCCAG GCAGATACGT CCAGTCCCGA TCCGTTATTT 

4 81 AATCCTCTAA AAACTGGCGT TTGCCAACTG GATAACGCGA ACGTGACTGA CGCGATCCTC 
541 AGCAGGGCAG GAGGGTCAAT TGCTGACTTT ACCGGGCATC GGCAAACGGC GTTTCGCGAA 
601 CTGGAACGGG TGCTTAATTT TCCGCAATCA AACTTGTGCC TTAAACGTGA GAAACAGGAC 
661 GAAAGCTGTT CATTAACGCA GGCATTACCA TCGGAACTCA AGGTGAGCGC CGACAATGTC 
721 TCATTAACCG GTGCGGTAAG CCTCGCATCA ATGCTGACGG AGATATTTCT CCTGCAACAA 
781 GCACAGGGAA TGCCGGAGCC GGGGTGGGGA AGGATCACCG ATTCACACCA GTGGAACACC 
841 TTGCTAAGTT TGCATAACGC GCAATTTTAT TTGCTACAAC GCACGCCAGA GGTTGCCCGC 
901 AGCCGCGCCA CCCCGTTATT AGATTTGATC AAGACAGCGT TGACGCCCCA CCACCGCAAA 
961 AACAGGCGTA TGGTGTGACA TTACCCACTT CAGTGCTGTT TATCGCCGGA CACGATACTA 

1021 ATCTGGCAAA TCTCGGCGGC GCACTGGAGC TCAACTGGAC GCTTCCCGGT CAGCCGGATA 
1081 ACACGCCGCC AGGTGGTGAA CTGGTGTTTG AACGCTGGCG TCGGCTAAGC GATAACAGCC 
1141 AGTGGATTCA GGTTTCGCTG GTCTTCCAGA CTTTACAGCA GATGCGTGAT AAAACGCCGC 
1201 TGTCATTAAA TACGCCGCCC GGAGAGGTGA AACTGACCCT GGCAGGATGT GAAGAGCGAA 
1261 ATGCGCAGGG CATGTGTTCG TTGGCAGGTT TTACGCAAAT CGTGAATGAA GCACGCATAC 
1321 CCGCTTGCAG TTTGTAAGGC AGTTATTGGT GCCCTTAAAC GCCTGGTGCT ACGCCTGAAT 
13 81 AAGTGATAAT AAGCGGATGA ATGGCAGAAA TTCGCCGGAT CTTTGTGAAG GAACCTTACT 
1441 TCTGTGGTGT GACATAATTG GACAAACTAC CTACAGAGAT TTAAAGCTCT AAGGTAAATA 
1501 TAAAATTTTT AAGTGTATAA TGTGTTAAAC TACTGATTCT AATTGTTTGT GTATTTTAGA 

15 61 TTCCAACCTA TGGAACTGAT GAATGGGAGC AGTGGTGGAA TGCCTTTAA.T GAGGAAAACC 
1621 TGTTTTGCTC AGAAGAAATG CCATCTAGTG ATGATGAGGC TACTGCTGAC TCTCAACATT 

16 81 CTACTCCTCC AAAAAAGAAG AGAAAGGTAG AAGACCCCAA GGACTTTCCT TCAGAATTGC 
1741 TAAGTTTTTT GAGTCATGCT GTGTTTAGTA ATAGAACTCT TGCTTGCTTT GCTATTTACA 
18 01 CCACAAAGGA AAAAGCTGCA CTGCTATACA AGAAAATTAT GGAAAAATAT TCTGTAACCT 
18 61 TTATAAGTAG GCATAACAGT TATAATCATA ACATACTGTT TTTTCTTACT CCACACAGGC 
1921 ATAGAGTGTC TGCTATTAAT AACTATGCTC AAAAATTGTG TACCTTTAGC TTTTTAATTT 
1981 GTAAAGGGGT TAATAAGGAA TATTTGATGT ATAGTGCCTT GACTAGAGAT CATAATCAGC 
2 041 CATACCACAT TTGTAGAGGT TTTACTTGCT TTAAAAAACC TCCCACACCT CCCCCTGAAC 
2101 CTGAAACATA AAATGAATGC AATTGTTGTT GTTAACTTGT TTATTGCAGC TTATAATGGT 
2161 TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG CATTTTTTTC ACTGCATTCT 
2221 AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG TCTGGATCGA TCCCCGGGTA 
22 81 CCGAGCTCGA ATTCGTAATC ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC 
2341 ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG TGCCTAATGA 
2401 GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG CTTTCCAGTC GGGAAACCTG 
24 61 TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA GAGGCGGTTT GCGTATTGGG 
2521 CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG 
2581 GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA 
2641 AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG 
2 701 GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCGACG CTCAAGTCAG 
2 7 61 AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC 
2 821 GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCG 
2 881 GGAAGCGTGG CGCTTTCTCA ATGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT 

2 941 CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCG.ACCGCTG CGCCTTA.TCC 

3 001 GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT TATCGCCACT GGCAGCAGCC 
3 061 ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGeTG CTACAGAGTT CTTGAAGTGG 
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Figure 22 (continued): 

3121 TGGCCTAACT ACGGCTACAC TAGAAGGACA 

3181 GTTACCTTCG GAAAAAGAGT TGGTAGCTCT 

3241 GGTGGTTTTT TTGTTTGCAA GCAGCAGATT 

33 01 CCTTTGATCT TTTCTACGGG GTCTGACGCT 

3361 TTGGTCATGA GATTATCAAA AAGGATCTTC 

3421 TTTAAATCAA TCTAAAGTAT ATATGAGTAA 

3481 AGTGAGGCAC CTATCTCAGC GATCTGTCTA 

3541 GTCGTGTAGA TAACTACGAT ACGGGAGGGC 

3601 CCGCGAGACC CACGCTCACC GGCTCCAGAT 

3 661 GCCGAGCGCA GAAGTGGTCC TGCAACTTTA 

3 721 CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT 

3781 ACAGGCATCG TGGTGTCACG CTCGTCGTTT 

3841 CGATCAAGGC GAGTTACATG ATCCCCCATG 

3901 CCTCCGATCG TTGTCAGAAG TAAGTTGGCC 

3 961 CTGCATAATT CTCTTACTGT CATGCCATCC 

4021 TCAACCAAGT CATTCTGAGA ATAGTGTATG 

4081 ATACGGGATA ATACCGCGCC ACATAGCAGA 

4141 TCTTCGGGGC GAAAACTCTC AAGGATCTTA 

4201 ACTCGTGCAC CCAACTGATC TTCAGCATCT 

4261 AAAACAGGAA GGCAAAATGC CGCAAAAAAG 

4321 CTCATACTCT TCCTTTTTCA ATATTATTGA 

43 81 GGATACATAT TTGAATGTAT TTAGAAAAAT 

4441 CGAAAAGTGC CACCTGACGT CTAAGAAACC 

4501 AGGCGTATCA CGAGGCCCTT TCGTCTCGCG 

4561 CACATGCAGC TCCCGGAGAC GGTCACAGCT 

4621 GCCCGTCAGG GCGCGTCAGC GGGTGTTGGC 

46 81 TCAGAGCAGA TTGTACTGAG AGTGCACCAT 

4741 AGGAGAAAAT ACCGCATCAG GCGCCATTCG 

4801 CGATCGGTGC GGGCCTCTTC GCTATTACGC 

4861 CGATTAAGTT GGGTAACGCC AGGGTTTTCC 

4921 GCCAAGCTTT ACACTTTATG CTTCCGGCTC 

4981 AATTTCACAC AGGAAACAGC TATGACCATG 

5041 AAATAACCTC TGAAAGAGGA ACTTGGTTAG 

5101 GGAATGTGTG TCAGTTAGGG TGTG6AAAGT 

5161 AAAGCATGCA TCTCAATTAG TCAGCAACCA 

5221 GCAGAAGTAT GCAAAGCATG CATCTCAATT 

5281 CGCCCATCCC GCCCCTAACT CCGCCCAGTT 

5341 TTTTTTTTAT TTATGCAGAG GCCGAGGCCG 

54 01 GAGGAGGCTC GAGGAGCTTG G 

// 



GTATTTGGTA TCTGCGCTCT GCTGAAGCCA 
TGATCCGGCA AACAAACCAC CGCTGGTAGC 
ACGCGCAGAA AAAAAGGATC TCAAGAAGAT 
CAGTGGAACG AAAACTCACG TTAAGGGATT 
ACCTAGATCC TTTTAAATTA AAAATGAAGT 
ACTTGGTCTG ACAGTTACCA ATGCTTAATC 
TTTCGTTCAT CCATAGTTGC CTGACTCCCC 
TTACCATCTG GCCCCAGTGC TGCAATGATA 
TTATCAGCAA TAAACCAGCC AGCCGGAAGG 
TCCGCCTCCA TCCAGTCTAT TAATTGTTGC 
AATAGTTTGC GCAACGTTGT TGCCATTGCT 
GGTATGGCTT CATTCAGCTC CGGTTCCCAA 
TTGTGCAAAA AAGCGGTTAG CTCCT^PCGGT 
GCAGTGTTAT CACTCATGGT TATGGCAGCA 
GTAAGATGCT TTTCTGTGAC TGGTGAGTAC 
CGGCGACCGA GTTGCTCTTG CCCGGCGTCA 
ACTTTAAAAG TGCTCATCAT TGGAAAACGT 
CCGCTGTTGA GATCCAGTTC GATGTAACCC 
TTTACTTTCA CCAGCGTTTC TGGGTGAGCA 
GGAATAAGGG CGACACGGAA ATGTTGAATA 
AGCATTTATC AGGGTTATTG TCTCATGAGC 
AAACAAATAG GGGTTCCGCG CACATTTCCC 
ATTATTATCA TGACATTAAC CTATAAAAAT 
CGTTTCGGTG ATGACGGTGA AAACCTCTGA 
TGTCTGTAAG CGGATGCCGG GAGCAGACAA 
GGGTGTCGGG GCTGGCTTAA CTATGCGGCA 
ATGCGGTGTG AAATACCGCA CAGATGCGTA 
CCATTCAGGC TGCGCAACTG TTGGGAAGGG 
CAGCTGGCGA AAGGGGGATG TGCTGCAAGG 
CAGTCACGAC GTTGTAAAAC GACGGCCAGT 
GTATGTTGTG TGGAATTGTG AGCGGATAAC 
ATTACGAATT CGGCGCAGCA CCATGGCCTG 
GTACCTTCTG AGGCGGAAAG AACCAGCTGT 
CCCCAGGCTC CCCAGCAGGC AGAAGTATGC 
GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG 
AGTCAGCAAC CATAGTCCCG CCCCTAACTC 
CCGCCCATTC TCCGCCCCAT GGCTGACTAA 
CCTCGGCCTC TGAGCTATTC CAGAAGTAGT 
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Figure 23, The nucleic acid sequence of the Laina2/APPA transgene used for the eeneration 
of transgenic mice and transgenic pigs fSEO ID NO: 7^ 



LOCUS 

DEFINITION 

ACCESSION 

KEYWORDS 



REFERENCE 

AUTHORS 

JOURNAL 



transgene 17732 bp DNA SYN 14-APR-2000 

Lama-appA cut Xhol., 20623 to Notl.. 17732 

transgene 

parotid secretory protein; acid glucose- 1 -phosphatase; appA 

gene ; 

periplasmic p ho spho anhydride phosphohydrolase; artificial 

sequence; 

cloning vector 

1 (bases 1 to 17732) 

Golovan, S., Forsberg, C.W., Phillips, J. 
Unpubl i shed . 



FEATURES 

DEFINITION M. musculus Psp gene for parotid secretory protein. 
ACCESSION X68699 
VERSION X68699.1 GI:53809 
S0X7RCE house mouse. 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 

Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

REFERENCE 1 (bases 3777 to 5332;) 

AUTHORS Svendsen,P., Laursen, J. , ICrogh-Pedersen,H. and Hjorth,J-P. 
TITLE Novel salivary gland specific binding elements located in 

the PSP proximal enhancer core 
JOURNAL Nucleic Acids Res. 26 (11), 2761-2770 (1998) 
MEDLINE 98256451 

REFERENCE 2 (bases 7147 to 12653; 13952 to 17731) 
AUTHORS Mikke 1 s en , T . R . 

Direct Submission 

Submitted (07 -OCT-1992 ) T.R. Mikkelsen, Department of 

Molecular Biology, University of Aarhus, CF Mollers Alle 
13 0, 800 0 Aarhus, DENMARK 
REFERENCE 3 (bases 7147 to 12653; 13952 to 17731) 
AUTHORS Laursen J, Hjorth JP 

TITLE A cassette for high-level expression in the mouse salivary- 
glands . 



TITLE 
JOtJRNAL 



JOURNAL 
MEDLINE 



Gene 1997 Oct 1; 198 (1-2) : 367-72 
9370303 



F3SATURES 



Location/Qualifiers 
source l.to 12653; 13952 to 17731 

/organism="Mus musculus" 
/ s t rain= " C3H/ As " 
/db_xref ="taxon : 10090 " 
/ chromosomes " 2 " 

/map= "Estimate : 69 cM from centromere" 

/ c lone = "Lambda YPl, Lambda YP3 , Lambda YP7 " 

/ c lone_l ib= " Lambda - PHAGE ( Lambda L4 7 . 1 ) " 

/ germline 

/note="Allele : b" 



misc_feature 3777-5332 

/gene="PSP" 

/function=" salivary gland specific positive acting 
regulatory region" 
enhancer 7147, .8724 
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Figure 23 (continued): 

/evidence=experiniental 
exon 11778.. 11824 

/gene=**Psp" 
/note="exon a" 
/numberssl 

/evideiice=experiniental 
exon 12626. . 14190 

/gene="Psp" 

/note="exon b fused with exons h and i" 
misc_f eature 12644-12652 

/function=" consensus secfuence for initiation in higher 

eukaryotes " 
misc_feature 13952-13965 

/functions" M13mpl8 polylinker'' 



DEFINITION E. coli periplasmic phosphoanhydride phosphohydrolase (appA) 
gene, 



ACCESSION 
VERSION 
SOURCE 
ORGANISM 



M5870B L03370 L03371 L03372 Ii03373 Ii03374 L0337S 
M58708.1 GI:145283 

Escherichia coli DNA. 
Escherichia coli 



Bacteria; Proteobacteria; gamma subdivision; 
Enterobacteriaceae ; 
Escherichia. 



REFERENCE 1 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 



(bases 12653 .. 13951) 
Dassa,J., Marck,C. and Boquet,P.L- 

The complete nucleotide sequence of the Escherichia coli 
gene appA reveals significant homology between pH 2.5 
acid phosphatase and glucose -1 -phosphatase 

J. Bacteriol. 172 (9), 5497-5500 (1990) 

90368616 



FEATURES 

Source 



sig_jpeptide 
/genes= " appA' 
CDS 



Location/Qualifiers 

12653. .13951 
/organism* "Escherichia coli" 
/ db_xr e f = " t axon : 5 6 2 " 
12653. .12718 

12653 13951 
/ gene = " appA " 

/standard_name="acid phosphatase/phytase" 
/ trans l_table= 11 

/product= "periplasinic phosphoanhydride 

phosphohydrolase " 

/pr ot e in_i d = " AAA7 2 0 86.1" 

/db xref="GI; 145285" 



/translation="MKAILIPFLSLLIPLTPQSAFAQSEPELKLESWIVSRHGVRAP 
TKATQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYLGHYQRQRLVADGLLAKKGCPQS 
GQ VAI I ADVDERTRKTGEAFAAGLAPDCAITVHTQADTS S PD PLFNPLKTGVCQLDNA 
N^H'DAILSRAGGSIADFTGHRQTAFRELERVLNFPQSNLCLKREKQDESCSLTQAl.PS 



BNSDOCID <WO 0064247A1_L> 
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Figure 23 (continued): 

EIiKVSADNVSLTGAVSXi7^MLTEIFIjLQQAQGMPEPGWGRITDSHQWOTI.L 

YliliQRTPEVARSRATPLLDLIKTAI^TPHPPQKQAYGVTLPTSVLFIAGHDTNIA^ 

ALEIiNWTLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSLVFQTIiQQMRDKTPIiSIi^ 

PPGEVKLTLAGCEERNAQGMCSIiAGFTQIVNEARIPACSL " 
mat^peptide 12 719 13 94 8 

/geiie= " appA" 

/product= "periplasmic phosphoanhydride 
phosphohydrolase " 



mutation 



mutation 



mutation 



replace (12659 . . 12661, "gcg changed to gcc") 
/ gene= appA" 

/ standard__naine= " A3 mutant " 

/notesi" created by site directed mutagenesis" 
/citation- [3] 

/phenotype=" silent mutation" 

replace (13934 13936 , " ccg changed to ccc") 
/ gene a= " appA " 

/standard_name=" P428 mutant" 

/note=" created by site directed mutagenesis" 
/citations [3] 

/phenotype=" silent mutation " 

replace (13 937 1393 9, " gcg changed to get") 

/genes appA" 

/stauadard_name=" A429 mutant" 

/note=" created by site directed mutagenesis" 
/citations [3] 

/phenotype=" silent mutation " 



BASE COUNT 4719 a 4125 c 4168 g 4719 t 

ORIGIN 

1 TCGAGAGTAT CTTTGTCAGC TGTGCCTCCA ACAAAGGGGT ACTGTTGCCC ACATAGAAAG 
61 ATCTAAACTA ATTAATTAAT CCCTCACCCG CAAATCTTTC AGTCACTAAG TTAGCACGAT 
121 TGTTGAACAA GTTCTCCAAA GGAGAGATAC AGATGAGTGC GTATAGGGTG GACCTGGCTG 
181 CTGAGGAGAC ACCTGCATCT GACTAAGAAG AGCCACGGTG TTAGTTGAAT GGTGTGGAGT 
241 AGGGTGGTTC TGTGGGACAG TAGAAAATCG AGAGGCATGT GCCGTTTAGT GAACTGATGG 
301 AAGCTACCCC AAACGACAGA GATTGTCAGT CAGGCCAATC CGTTTCGAGT TTGATGGGCA 
361 GCCGGACAGT GAGACAGACA CACCTACTCA GTTGGAGGAA GGATGAGAAC AATGGCCAGC 
421 AGGGATTGAG AGACCCTGAC AGGCGCAAGG CCCTAACACA CACACCTACC ACCTCACTTG 
481 ACAAAGCTGC CAAAGACCAA AGACTTGTTC TCCATTAGAA ATGACAGCTG GCTTGACCCG 
541 ACAGCATAAT AAGCAGAGTG TACTCTGATT GGAGAACTTT AATGTGTTTC ATTCAGTATT 
601 ATAAAAGGAC AGTATTACAG ATTTTGTTGT ACACTGCTGT TACATGTGGG GCAGTGTGTC 
661 TTTAAGTAGG GTAAAGTACT CTTTAAAAAT GGGTCCTAGA TATTTTTTCC TTTAACTCAA 
721 GTCTCTTACT GTTTAAATGA TTTTTATTTT GTTTAATATG GAGGAAAAAG AAGCGTAAAT 
781 GGACAATATA TATTTAGAGA AAGATGGTTA GCTGTCAGAA AAATATGCAA ATCAAAATCA 
841 CACCAAGACT GCAGCACACC CCTGTCAGAT GGCTGTGATC AAGAAAATAA ATGACAATGA 
901 GTGGTGGTGA AGATGTACTA AAGGGAAACA CACACACACA CACACACACA CACACACACA 
961 CACACTGGAG CAA.CCACTGT GGAAATCAGT ATGAATGGTC CTCAAAAACC TGAAGATAGA 
1021 GCGGGGCGTG GTGGO^TACA CTTTTATTCC CAGCACTGGG GAGGCAGAGG CAGGTGGATC 
10 81 TCTGAGTTCC AGGCCAGCCT GGTCTATAGC ACAGGTTCTA GGACAGCCAG GGCTACACAG 
1141 AAAAACCCTG CCTTGATTAA ACCAAACCAA ACCAAACCAA ACCAAACCAA ACCAAACCAA 
12 01 ACCAAACCAA ACCAAACCAG ACCAAACCAA AACACTGAAG ATAGAACTTC AGTATTCCAT 

12 61 TCCTAGATAT ATACCCAATG GAGA.CTAAGT CAGCAAGACA CCTGCACAGC CATGTTCACT 

13 21 ACTACACTGT TCACCACAGC CAGGCTGTGG AACCAGCCTG AGTeTCCATG A.TAAATGAAT 
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Figure 23 (continued): 

1381 GGATAGGTAA CTTTCAAGGT AAATGGACTC TGCTGTGTAC ATGCCTCACA TTCTGTTTAT 
1441 TCATTTTTCT TTATGAGGTG TCCATTCAGG AGTCACATGG TAGTTCTATT TTCAGTCTTC 
1501 TGAAGATACT ACACTGGTCC CCACAGTTTA CACTTTTATC AGCAGTGAAT AAGGGTTCCT 
1561 CTATCCTTAC CATCATTTGT TGTAATTTTT CTTGATGACC CTCTTTCTGA CAGGGATAGG 
1621 ATGTAATATC AGTGTGAGGA AGTACAACTT GTTTTCTAAG TATTTATTGG CCCCTTGCAT 
1681 TTCTTCTTTT GAAAACT6TC GGTTCCTGAC ATCTGCTCAG GTATTCATTG GATGTTGTTT 
1741 CTTTGGTGTT TGAGTTCTTA TGAATTCTAG ATGTTAAATC CCTGCCTGTG GTTCTCTCCC 
1801 ATTCTGTAGG CTGCCTCCTC ACCCTGGCAA TTGTTGTCCT TGTTTTGCAG AAACTTTTGA 
1861 CTTCATGGAA TCTCATTTGT CAGTTTTCCC TCCTCTGCTA TAGCCTGAGC TAATGCACTG 
1921 GTTTTTACAG AGCCCTGGTC TATGCCTTTA TCCTCCTCTG GCAGCTTCGG AGTTTCATTT 
1981 CTTACATTTA GATCTTTGAT CCACTTTGAA CAAGTTTTGG AGCAGGGTGA GAGATACGAA 
2041 TCTAGTTCCA TTCTTCCATA TGTGATCCTA GTTTACATAG CATCGTTGGT TGAAGAGGTT 
2101 TTATTTTATT TTTAAATAAT GTGTCATAAA AAACGAGGTG GTTGTAGCAG TGTGGATTTG 
2161 TTTCTTTGTC CTTTGATCTA CAGGTCTTGT TTTGTGTCAG TCTCATGATG TTTTATTGCT 
2221 ATGGCTCTGT CATACAGTCT GAGGTCAGGT ATTGTGATAT ACCTTCAGTA TTGCTCCCTC 
22 81 AGACTCAGGT TTGCTTTGGC CAGGAGTCAT CTTACTCAGT GCTCTTAGAG CTCCCCCAGC 
2341 ATGTAGCTGC TACTATTCTT AGTTGATAAA TCAGGAAACT GGGGCTCAGA GAGATTAACT 
2401 GTCTTGAACT ACTTCTGGGG AGGTGAAACG TGGAGACACT AAACTGTGTT TACCCTGTAC 
2461 TGCTCCAGTA GCTGTCGGGT GCTGGGCTAC AGCAAAGCAC CTATACTATA TATTACTCAG 
2521 GAGGTGGAAA AACTCAGCCT CCCTTGGGGT TCCCAAGCTC CCAGGTGTCC AGTCACTGCT 
2581 GGAAACCTCA TGGAGTCTGA AAGGAAGGGT TGAGGGTACA TGGGGCAGCG ATGAGGAGCC 
2641 TGGGGCTGGG ATCTCCCAAA CACCTGGATA TCCAGATGCC ACTGGGTCAG GGGGAGTTGG 
2701 GAACAGAGTT GGGATGTCCA TGGACCTGTG ACAAGGCCAG GGCCAGGGGG AGGATAACTC 
2761 TGGCTTTACT AATTTGCGAA AGTCCTTAGC TTAGCAGCAG TTGTCTGGGA GCACAGAGGG 
2821 GCCTTCTGTA AGAGGCTCAG GCAGTGCCGC TCTGTAGGCG AAGGTCTTCT CCATGTTCCC 

2 881 CATGGTGGTT CTTGATGAAA GAGACAGTCC TTGGCTCCAA ACTGGTTTAT TGATTGTTCA 
2941 TTGTGGAAAA TGGGTGCACA CCACCTTCTC AGGGTGGACC AGAGATCAAA TACCTTTTGC . 

3 001 AGGGAGGAAT ATCTGGGAAG GGACGCTTAC TGGCTAAACC CTCAGGGCCT CTAGATACAT 
3 061 CATTAGCATG GAGAACTCTG TTCTGGGCTA CATGACCACA GGCCACATTT CCACAAGCCA 
3121 CATGTGGGAA GTGTGGCACA TGTTCTAGGC CAGGAATCTG GTAGGGAGCG TGGAGCCACC 
3181 TACCATCCCA QGTGGGTGCC TGGGTGCCAG GGACCCTGAA CCCGCTCAAC CTTACCAAGT 
3241 TTCCTGGCAG GGTCCACTGT CCTACACAGA AGCTGGAGGA GGTGTGAGGG TTGTGTCTTT 
3 301 GTGGAATGTC CCATGCTGCT TGGGGCTCAG TTTCTCCACC TGTACCTCAT TGGTTTGGGT 
3361 ATAAAAAGTG GGGATACTTT ATTATTCTCT GACTCGGTCC TGAGGAAAAA GCATCGTGGC 
3421 AGTCCAGGAA CCACACCCTG AGGTTCCTGC ACTGAAGGGA CTCCCTAAGT CTCTGGAGTC 
3481 TCTCCCCTTC ACAGAGCTGC CAAAGTCTAG GTTCTTTTGA GGATAACAGA GCCATGCTTG 
3541 GTAAGCAGAC AACAGCATTT GTTTACTCAA CCTTCTTTTG TCAGCTCCCT CTTCATAAAC 
3 601 AAGTTGAGAC ACCATGCTGG CTTGAGGAAG ACTTCTAAAG CCAGACAACT GTGCAAGGAA 
3661 GAAGAAGAAG GGGCAAGTGG AGTTAGCCTG GATGTAGCCC TCAAAGTCTC CAGAGACCAG 
3721 CCATGAAGGC TCAAGTGGAG GGCAAGACCT GCAGCAGCCA AGCATCTGGC AGGAGAGGAT 
3781 CCTGGGAACC CCTCTACCAT GACACACATT CTTCCTGCAG GTCACACTTA ATAGGCCATT 

3 841 TCTTATTTGG ATCTATCATG GTGTTCTGTG CGAGATTAAT GAGGTGTTAT GCTGCGAACA 
3901 GAAAGTTATA TAAAAACAAG TCCCCCCCCC TTGTCACTGC TGCTAAGAAT GTA6CAGAAA 
3961 TTGTCTCAAG TGTCTCTCTA ATCAGAAACA ATAAAGGTCT CCTTGGATTC AAGCCCTCCA 
4021 GTTTCCTCCT TCCTTGCTGA GCCTTGGACA CCCATACAAA CCTCCTGGAT GCTACAGCTC 
4081 TGGGCAGAGA CTCCAAGGTG GGGAGAGACT GATGGTACAA AAGCAAAATA CTTGTTTGGG 
4141 GGTACACCCA CTCCTCTGCC TGTGTGGTTC CTGCAGTCAG TCCTGCAGAC AGGCCCTCAG 
4201 TGGGTCTTCC ATGGGCAACA CGCAGAGGGA GGCAATGGAT GGGAATACCC ACACCCTGGT 
4261 TAGTTTACCC CGGCCATGCT CTCTGCTCTT CATCCCTCCT CTGCCCTCTG CCACGGCTTT 
4321 CTCTGCAGGA ATCATATCTT CATATTGGCC CACAGGTGTT CTCCTCACCC TAGCTATGAT 
4381 GTTTACTTTA GAGTGACCTT AGCAGGGCTG GTGGGAATGA GTTCTAGAAG GCTCACGGAG 
4441 ATGCTAGGGA AGAAACGTCT TCTAACTACT GAGGTTACTA AGTTCCTGGT GGTTGTCTCT 
4501 GCCTTTCCCT TGTTAAAGTC ACCTTGAAGT TAGTGCAGAA GAAATCAGAG CCCAGTCACA 
4561 GAGTAAATAT GGTCCTGAAG ATTTCCTTTG AGTGCCCAGA ATCCATGACA TTTCAAGAGC 
4621 CCTCTTTGTA CCTTAAGTCA TTTGGGGTTG TATCTTCTGC TTGATGTATG TGTGTGTGTT 
4681 TATCAAAGAG TGAGATGGTT ACATAAGAGG TGCTCTAAAG GACAGAGAGG ATTTGCAATT 

4 741 GTGGCATGTG ACATCCTCAG GCCTTGCTCT GGTGCCAGGA GGAACTGATG CAGAAAAGAG 
4 801 TAAGAGGTCA TTTCCTGGAG GCTGTCACTA TAGAGGAGAT CTTAGAGTGC ATTCCCTCCT 
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Figure 23 (continuedV 

4 861 CCAGGCCCTG CCTGAGGATA GACATGTGCT GACTGCAACT GAAACAGAGG CTTGGGATGG 
4921 AGAGTTAGGT TCACAGAAGG GAGGGTGGGA GATGGATGCT TGCTGGGTTC TGGGTCTCAT 
4981 CACCAGCTCC TGACCACCCG GTCAGCCCAT GTGCTTATTC CATAGCTTTC TTTTGCTATG 
5041 TTTACTCAGT GTGGTGTTTG TTGGGACCCA GCAGAAGCCA GTCCCAGGCT GACAGCTGTG 
5101 GATACACAGG GCAGCATGAG GGTCCTCAGC CTGAAGCAGT CAGGCTGGCA GAAGAGAAAG 
5161 ACCAGCACAC ATTCCTTCAA CCAACTATGT CTTGAAAAAC AAACATATTA TATCACATAT 
5221 ATTGCATTTA TGAGACAGCT AAAATGTACT CGGGTAGCAT GACTCCAGGT GGGGATATCX 
5281 GCAAGTGCCA TGAGTGGCAG AGGGACAGCC AATGTGAGGC AAGAAGGAAT TCTGGCTCAA 
5341 CACAGCTTAG CTCCCTGGTG TTGGTTCAAA CTTTGAGAGT TTGACCACAA GCACTTTATT 
5401 TTTGACATAT TTAAACAGAG CACAACTTTG GGAAAAAGTT TTCTTATGAA AATTATCACA 
5461 ATAAAGCTTA AGGCATGACT ACATTAAAAT GCCTTTGCAA AGTATATGTG CCCTCTTCCA 
5521 CAAGAATGGT TCTATTGACT GAGAAATAAT GTTCAGGATA AAGATCCAGG AAGAAAAGAT 
5581 CAGGGATAAG TAT^AATACTA AACTCTTTTG CAAAGTACAT AGACCCTCTT TCATAACAAT 
5641 GGGTTCTATT GACTGACAAG CACTGCTCAG GAGTTGGGAA AGAGTCTAGC ATAAGCACGA 
5701 TAGCCTGGAG ACTCTAGTGA GGTCTAGTCT TACAGACAGC AAAAATCACC AGGTTACAAA 
5761 CTACATTCAT TTCCAGTTTT CTGATCAGGC ACAGGTATGA ATCCCTTCTG TTGAAGAGAA 
5821 AAGTCCATGT GTTTAAAATA TCTGGTTTCT CCAGTGCTAT TAGCGAGAAG ACTTGAGCCC 

5 881 TATACAACTC CCACCTGGAG TGACATCCTG TCTTCATGGT ATATTACATA CCTAGACACG 

5 941 CTCATCTCAC AGACTTAGGA CTTTGTCTTC TGATCTCCAT TTCTGATCCC ACTTCCACCT 
6001 TTGCCTTGAT AGTGTCATTT TCTTCACTGC CTTGGTGACA ACCATGTTAT CCTCTGTGTA 
6061 TTTGAGTGTT ACCATTTTCA GATTTTACCT GTATGCAAGA TCACACAGTC TTTGTCTTTC 
6121 TGTCTGGATG CATGCTAATC TCTACACAAC AACCCTTCCC CGTCACTCAG ATCTTCCZTCC 
6181 ATTAACACAT ACATGGTGCT GAAGAGGCTA GGGAGCTTCC CTTCAGTGGG GAGCTAGCTG 
6241 GCTATTGGGC CTTTTTGACT GTCCAGGAAG GCCCCCAATT GCTGAGACAA GAACTTAGAT 
6301 TCTTCATTAT TGACTCTAAC TCATGTATCA AGCAGAAGCT AATGAATAGT TATCAACAGG 
6361 ATCAGAGGTT CCAGTGTAAG ACACTTTGAC ATGAAAGAAC GGAGGAAGGA CAGATGGATG 
6421 CATAAAAGCA GGACCACTGC CCCAGQAAGG TCCTGGAAAC TGATGCAGGG CAAAGGACAG 
6481 GTTATAAACC AAATCTTAGG GAGTCAGGAA GAGCACAGAG GAGCTCAACC AACTGACCAC 
6541 TGCTTAGGGG CTACCAACCC AATCCTCCCT GTGGGAACAG CTAAGCTATC AGCCAAGGGT 
6601 AATAAACAGG CAGGACCTGT GGATGACATG GAGAGCATAG GGACCCTGGG TCCAGCCTTT 
6661 AGCACCTGCA CTCTCAGGAT ACTCCACCAT TGTGTCTTAG AGAGCCTAGG GATACTGGGT 
6721 CCAGCCTTTG GTACCTTCAC TCTCAGGGTA CCCCATCACT GTGTCTTGGA GAGCCTAGGC 
67 81 ACCCTGGGTC CAGCCTTCAG TACCTGCGCT CTCAGGACAC CCCACCATTG TCTCTTGCCC 

6 841 CGTCTCTTCT TCCTCTTCCT CCCTTTCATT GTCTCTTCTC TGTTTCTTTC TTGACTCTCC 
6901 TTTCCCCTCA CACCCTCACT CTAGTTCTCC CCTTCCCTCT CTGCATCACC CTATTCTCTC 
6961 TGTGGTCCCT CCACTTTCCT TTATCTCTCA TGCTTCTCTC CTCCCTCAAA TACTTGTCAC 

7 021 CCACTATACT TCAGGGGCCA GCTCTAGTGA CAAAGCTGTT T^TAGCAAGA CTCTCAGATC 
7 081 TCCAACGGCT CAGAGGAGCC AGACCCACCA AGAACTCTCT CCAGGTCCAA TTTCAGGTTC 
7141 CTTCGAAAGC TTTCAGCAAA TGCTCAGGGA ACATGCCACT AACAAGAAGA TGCAAATTCC 
72 01 AGTTGAGAGT GGGAAAGGCC CTTGCGTAGG TCCCATCTTC CAGGCCAAGG TCAGAGGGGC 
7261 TCTGTGTAAT CCGGATTGAC AGGGCTCAGA ACAATGTTTT GTTTTTAAGG TTTATTTATT 
7321 TTAGGTGTTA GTGTCTTTGC TTGCATGACC TTATGTGCAT CATGTGTGTG CAGGTTCCTG 
7381 ATGACAGTAG AGGAGGGCTT TGAATCCCTG GGGATAGGAA GTTACAGGAA ATTATAAGCT 
7441 GCTTTGTGGG TCTTCTAGCT TTCCCAACAG AAGTGAATGC TCTTCACCAC TGAGCCATCT 
7501 CTCTAGGCCC AAGAGACATT GCTTTATGGA TATAATTGTG TGTGTGTGTC AACATTGAGG 
7561 AAAGGGAAAT AAAAAAAAAA CTTCAGCCGC TAAGGTTGTA CAGTTTCACT AATTGCTACT 
7621 TTTAGTTGTG ATAAAATGGC AGGTGCTTCA ACATTTATAT ATACAAAAAC TTCCCTGCTG 
7681 GTGGTTCAAC TGTGAGAACT GGGGTAAGTG GGTGAGTTCT CTTTTTCTGT CTCTGTCTCT 
7741 GTCTCTCTCC TTCCATTCTT TCTTAAAGGA AATAAACATT GCAGCTGGGT TATAGCTCAT 
7801 CAATATGGAA GTTACAGAAG TGAAAAAAGG CATTGCCTTG GTGGGTGGTG TTACCAGCTG 
7861 ATTTTTGGTT GTCCTGCAAG GAGGTCTGGG GACTGGCTGC TCTGTCTCTG TCTGTATGAG 
7921 TGAGGGAAGT CTGGGGAGCA GATTCCCTAA CCTTCAGCCT GGCCTGGTTC CTQAGTGAAC 
7981 CCAGCCTCTC TGGTCCTAGT AGCTTTTTCC AAACAGGAAT CTGAGTGGTG ACAGGGAACA 
8041 AGTACCAGCC CATTGCTTAA GTGCCAGGGT TAGTGAGGGC AGGAAGCTGC CATAGCTGGG 
8101 ATTAGTAGTT GTATTGGATG TAGGAAGTCC TATCCTGGGA CAGCTAATCC TTAATGCTTC 
8161 ACTGGAGATT TTCAATGA.GA. AATTTATCCC ACGGCCCATA TGGCCCCATC CTTTTGTCTC 
8221 "CAACAGCCAA GTATTTTCCA TTAGAGGAGA CTTCCTGTAC ACTTGATGGA. TGCTCATTCC 
82 81 AAGGTGACTT GGGGCAGTCA GTACAGACTT GGGATGACCT CTGACASGCT AACCTCTCCC 
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Figure 23 (continuedV> 

8341 CAACAAGGGC CCTCTATGTT TGCTATGTAA TGTAATGTCA GACATTGTCA GGAGTGTCCG 
8401 CAGCACAGCC TGCCCAGTGT GAGGGCTCTC ATAGGTTTCC CACTGTCTTA TCTACACAGG 
8461 GATAACGAGG AGGTAAGCTG CAGTTCCCAG TCTCACTTCA CAGAGGAAGA GATAACCCCA 
8521 TCCCAGGTCA TGTAGCCAGC AGTGGAAAGA ATGAGGATTT GAACTCAGGT CTTCCAAGTC 
8581 CCATTGATAG CATCTCCTCA CAAGTCCCTT GCCACCCTCA CGATGCCTTA GACACTTGCC 
8641 TGCCCTTTAT ACTAAGGAGA TGCAGGTACA AGGGGTTTAC CCATGTAGCA GCTGAGGCAG 
87 01 CTGGGGATAG ATACCAGCAG CAGGCCTGAT GTCACCACTC TAACTCCAGC ATCCCCAGTC 
8761 TGTGTTCCTG GAGTGTGAAA ATCCCTACTT AACAAGATTG TGCAACAGTC CTTGGCTCTG 
8821 TGACCCATAG CTGGAAACAG GATTCTCATT GATTTGTGGA ACATGGTGGC AGCCAGCCAA 
8881 AAAGAGGGTC TGCATACAGA AGACACGTGT GGCAAGGCCA CAGCAGACTC TGACTACCTT 
8941 AGCTTACAGA ATTACAAGGT CATAATGTCC TCTGCTTTGG TCACCTCATG TTAAGGACAG 
9001 GCCCTAATGA AGATGGGGCA GAAGACTGAA GGAATGGCCA ACCAATAACT GGCCCAACTT 
9061 GAGACCCATC CTACAGGCAA GCATCAATTC CTGACACTAC TAATGATACT CTGTTATGCT 
9121 TGCAGACAGA AGCCTAGCAT AACTATCCTC CGAGAGGTCC ACCCAGCAAC TGACTGAAAC 
9181 AGAAAAAGAT ATCCACAGGC AAACAGTGGA TGGAGGTCAG GGACTATTAT GGGAGAGCTG 
9241 TGGGAAGGAT TAAAAACCCT GAAGGGGATA GGAACCCCAC AGGAAGACCA ACAGAGTCAA 
93 01 CTAAGAGACC TGTGGGAGCT CTCAGAGACT GAGCCACCAA CCAAAGAGCA TACACAGGCC 

93 61 GGTCCGAGGC ACCTGGCACG TGTGAAGCAG ACATGCAGCT CAGTCTCCAT GTAGGTCCTC 
9421 CAATAAGCGG TAGCCTGACT GCAGTATCCA ATCCCCAACA GGGCTGCATA GTCTGGCCTC 

94 81 AGTGGGGGAG GATGCCCCTA ATCCTGCAGA GACTTGATGA GTGGAGAGCT ATCCAGGGGG 
9541 AACCCACCCT CTCTGAGAAG GGAATGGGGA TGGGGGAGGG ACTCTGTGAA GAGGGGACAA 
9601 GGACAAACAA GAACCTCAAA. TAGGTCAGGC CCTAAAGGCT TGCTAAGTAG CAGTGGCCCA 
9661 GCTCTGTCCT GTTCCTCAGC CCAAGGCTCA GCTCCCACCT GTTTCTGTGT TTTTCTGGCT 
9721 TTTCATGGGC CTAGGACTTG GTGACCAGTT CAAACAATGG GGCCTGTGGA AGACACAATA 
9781 TACAAGACTA GGGACATTCC TGTTCTGCTG ACTATCCATA GCCTGATGTA GGTGGAAGGA 
9841 CCCAATCACT GGATTTCTAC CCTTGCACAA CCTTGACAGC TGAGGGCCTC TCAGAAACCT 
9901 ATTTCTTCCA CTGAAAAATG AGACTCTCAA ATGAACGTCG TGACAATCAT CAGGCTTATT 
9961 AAAGAGGTGT ATCTAACCTG AATGGCAAGC AGACAGCAGG CAAATGTCTG TATCAACCTC 

10021 TAGGAAGGAC AAGAACTGCT CACTGCTGCC CCCCAGGAGG CCATTTGCTG AAACAGCTGC 
10081 TCTCCTGCTG GTGCACAGGC CCTGCCTTCT CATTGCAGCC ACAGCCCCTT CCTGTCTGAA 
10141 CCTCCTGTCA GGTCACTGGG AAACAGATCA AGATGGAACA GGACAGCTCC TGATGGTAAA 
10201 TAAAAAACAG TGGTCATGGC TATTCATAGG GGTTTATGCT TCTTCAGTCC ACACTGTGAA 
10261 GAGCTGTGGG CATGAACCAC AGTGTTCGAG GTAGAGTTGG GGTTCTGAAA TTCACAGTGG 
10321 GGTGAGCTCA GTAAATGTGA GCTGGAGGTC ACTCGTGAGA CACACAGTCC TGCTGCTTCT 
103 81 GTTCCCAATA TCCTGAGGAG ACGACACATC TACTTTGTTC AGAGGCCACA GTCTAGTTGA 
10441 CCTGAGAGTT ACCAGTTTCT TATTTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG 
10501 TGTTGTTCGT GTGTGAGTGC AGGTGCACAT ATGATAGCGT ACACGTTGAG GTCAGAGGAT 
10561 AACTATCAGG CGTTGTCCCC TCCTACTTTT CCTCGGACTC TGGAGAACAA ACATGGGTCC 
10621 TTATTCCAGG GGAGCAAGTC GCTGTTGGCT GACACATCTT GCTCACATAC ATTTTACCTA 
10681 GACAATGGAG CCTCCATCAG AGTATTACTT TAGCTCCTCA CCGATGGCAA TGCACCACCT 
10741 CTCTACCCAC ATAGGAGTTG GGTCTCCACA CACCCCCACA CCCCCTTCAC CAAAACGTTT 
10801 TCAGTTACTT TATCTGGTAA AGTTCATCAG AGAATGAAGC CAGTATTAAG AACATGGAAT 
10861 CATTTGGGAA CCTGGATCTA GCAATACCCC ACCCTAGATG GAGTTGCTGA GTTTTCACCT 
10921 CAGATTATAA TTCCCCCCTA GCTTCTATGG TTTATTCTGA AACCAGGGGA ACTCGATTCC 
10981 TCCCTTTGGA CCACAGACAT CCTGGCTTGT GAATTCACAT GTCATCTACT GCTAATCCAT 
11041 TGGTAGTATG TGGCTCACAG AGACACACTA CAGTCATGGC CAATGTCAAG GTAGGACAGA 
11101 TGTGAATCAT TCCCCCAGTC CTGCTGTTTT CATGACTAAC CCTCCTCAGC ACAGTGACCA 
11161 TGAACCTACT TTTCCCCTCC TTTTATTTTT AGAATTGCTG GAATTTTCTA TTTTGAGAAA 
11221 TAATAGCCTT GGGCAGCATT AAACAAAATC ATCTAGAAAG CTGGTTTAAA ATACAGATGG 
11281 TTGAGTCAGT GAAAGAGTGA GGAATGTCAT TATTGGCCCC TCACAGAGGC TGGCTCACTC 
11341 CAGCAGAGGT GGTTGAAGCT CTTGGACACG GGTCAGGTGC ATAGGAAAGG TNGTCTGGGA 
114 01 CACTGAGAAC CACAATTGAA CAAACAGAAC TGTTGGCTTT TTTTTTTTTA AATGAGTTCT 
114 61 CAAAAAATGA CTGGCTAGCT TAGGCAAATA CTTCGAGCCA ACCCAACAGA ACATTCTTCC 
11521 ATTGA.TTCAT TCTGGATCTT CTTTCTAGAC AATACTGAAC TGACCCCTTG TTGGCAGTCT 
11581 CAAGTTTGAC AACA.TAGGGC TTTGAACTTG GCACAAGGTC CATCACTGTC ACCCAAGCAT 
11641 CCTGGGTGAC CTTTGGGTTG GAATATCTTG GCTAACCTTA GATATTTTCT TTGGAGTATC 
117 01 TTTAGAACAT CCAGGAAATA GGGCTTGATT CTCATCCTGG GACCACAATA TAAGTCACCC 
117 61 TAGAATCCCA GGAGATCGTG CAGAGAAACA AGGATCTCTC TCGTGTG0.3: CCTTCTTCAA 
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Fi2ure 23 (continuedV 

L1821 AGCAGTGAGT AGTGACTCCA CTAAACTGAG TTCCCATCTG AGAGTCCACA GGAGGCTTTG 

118 81 GGGCAAGAAG CAGAGGGAAG GCACTGTTTG TGTTGGTAAA GTTTTGACTC TAACAAATTT 

11941 GAAGACATAG ATGACATTGT GTCAGACTAA CAACAACCTA GACTCATGTG GGTTCTGTTT 

12 001 AGGGATCAGA TTTTATTCAT CAATGACTTG TCTTAGTGTA TAGAGAAAGG CTTCCTACTG 

12 061 GAGTGTAGGC TCAATAATGA CAGAAGAGAT AGCTATTTCC CCTAGGGACT GTGCTGCTCC 

12121 AAGTTTGGTG GAGAAAGGCA GTGGGGAACC TAGATGTGCT CTCTGGGGAG GGGGTCTGAA 

12181 GCTGGCTTCA TAGAAGGTGT GAAGTTTTGC TGAAACATCT AAACAGAATT ATAGCTTAGG 

12 241 AAAGTGAGCA GGCAAGGCAG GGAA.TGTGTT GCATATGTAT ATGTACATGA ATATATTATG 

123 01 TTATAGATAC ACACACATTT GAACCTCATT TGCAGATGAC AGAAAATAGG TTATTTTGCC 

123 61 TCTCTTAACT GCTAAGCACA ATGACTTCCA GTTCCATCCA TTTCCTGAAA TGCCACAATT 
12421 TCATTTTTCA TTGTGGCTGA ATAAAATTCC ATTGCAGACT GGGCCCTACT TCATCCACTC 

124 81 CTGAGGGCAG GCATATCCCC TGGCTCCATT TCTTACCTAT TGTGAAGAGA AGTGCAACTG 
12 541 TCTTGTTGAA AGGCAAGCGT GAGAGAGGCA GGCACTAATT GTGGGTTTTT GTTTCITCTT 
12 601 CCTGCTATGA CTCTCCATTT GTCAGAACCA AAGATCGATA 7VAAGCCGCCA CCATGAAAGC 
12 661 CATCTTAATC CCATTTTTAT CTCTTCTGAT TCCGTTAACC CCGCAATCTG CATTCGCTCA 
12 721 GAGTGAGCCG GAGCTGAAGC TGGAAAGTGT GGTGATTGTC AGTCGTCATG GTGTGCGTGC 
12 781 TCCAACCAAG GCCACGCAAC TGATGCAGGA TGTCACCCCA GACGCATGGC CAACCTGGCC 
12 841 GGTAAAACTG GGTTGGCTGA CACCGCGCGG TGGTGAGCTA ATCGCCTATC TCGGACATTA 
12 901 CCAACGCCAG CGTCTGGTAG CCGACGGATT GCTGGCGAAA AAGGGCTGCC CGCAGTCTGG 

12 961 TCAGGTCGCG ATTATTGCTG ATGTCGACGA GCGTACCCGT AAAACAGGCG AAGCCTTCGC 

13 021 CGCCGGGCTG GCACCTGACT GTGCAATAAC CGTACATACC CAGGCAGATA CGTCCAGTCC 
13 0 81 CGATCCGTTA TTTAATCCTC TAAAAACTGG CGTTTGCCAA CTGGATAACG CGAACGTGAC 
13141 TGACGCGATC CTCAGCAGGG CAGGAGGGTC AATTGCTGAC TTTACCGGGC ATCGGCAAAC 
132 01 GGCGTTTCGC GAACTGGAAC GGGTGCTTAA TTTTCCGCAA TCAAACTTGT GCCTTAAACG 
132 61 TGAGAAACAG GACGAAAGCT GTTCATTAAC GCAGGCATTA CCATCGGAAC TCAAGGTGAG 
13 3 21 CGCCGACAAT GTCTCATTAA CCGGTGCGGT AAGCCTCGCA TCAATGCTGA CGGAGATATT 
13381 TCTCCTGCAA CAAGCACAGG GAATGCCGGA GCCGGGGTGG GGAAGGATCA CCGATTCACA 
13441 CCAGTGGAAC ACCTTGCTAA GTTTGCATAA CGCGCAATTT TATTTGCTAC AACGCACGCC 
13501 AGAGGTTGCC CGCAGCCGCG CCACCCCGTT ATTAGATTTG ATCAAGACAG CGTTGACGCC 
13 561 CCATCCACCG CAAAAACAGG CGTATGGTGT GACATTACCC ACTTCAGTGC TGTTTATCGC 
13621 CGGACACGAT ACTAATCTGG CAAATCTCGG CGGCGCACTG GAGCTCAACT GGACGCTTCC 
13681 CGGTCAGCCG GATAACACGC CGCCAGGTGG TGAACTGGTG TTTGAACGCT GGCGTCGGCT 
13741 AAGCGATAAC AGCCAGTGGA TTCAGGTTTC GCTGGTCTTC CAGACTTTAC AGCAGATGCG 
13 801 TGATAAAACG CCGCTGTCAT TAAATACGCC GCCCGGAGAG GTGAAACTGA CCCTGGCAGG 
13861 ATGTGAAGAG CGAAATGCGC AGGGCATGTG TTCGTTGGCA GGTTTTACGC AAATCGTGAA 
13 921 TGAAGCACGC ATACCCGCTT GCAGTTTGTA AGGTACCCGG GGATCACAAC TTGCCCTCTG 

13 981 AAGAGGAAGA ACAGAAGGAT GCCACAACTC TCCTGCTGGC TACTCTCCAG TGGTTTCATC 

14 041 TTACTTCTGA TGGCATTTCC CTCTAGAAAG TGCTACTATC ATCCACACAT TTCTACCTGA 
14101 GACCACCCAA AGGACCCTCC CAAATTCTCT TCCTCTCTGA GTAGTCTCCA CACCTGTTAC 
14161 CACCATCCCA GAATTAAAAT CCTAACTGCA CTCTGGCGTG TGACTTGCCT CAGTCCTTGC 
142 21 AATAAGAGTT GTTGGCAGTG CCAGGCGTGG TGGCGCACGC CTTTAATTCC AGCACTTGGG 
14 2 81 AGGCAGAGGC AGGCGGATTT CTGAGTTCGA GGCCAGCCTG GTCTACAGAG TGAGTTCCAG 
14 341 GACAGCCAGG GCTATACAGA GAAACCCTGT GTCGAAAAAC CAAAAAAAAA AAAAAAAGTT 
144 01 GTTGGCAGAG TGTGGGTTAT ATACCAGGTG GAGATTTCAA ATGAGTGGCT GAAGCTGTAG 
144 61 CCAGAAGGAA CTTAGAGGAT AGCTCATAAC TTAAAAAGAA ATGTAGAGAG TAGCAGAAAC 
14521 ATTGAGAGAG TGGGCACACA GCCACTGTGT GAATGTGGCA GAACACAATC CAGCCAGCTA 
14581 TACATGCATA AGTGTATATT GGCGCCATCC TGACTGATGA GACACAGGAA AACAGATAGA 
14641 CGGGGTTAGG TGGCCATGGC CTTTCCTGCC TGCCTCTTCC TAAGGGTCAT CTCAAGACCT 
14701 TATGCTCTCT TAACTCTTCC ATTGCTACTT AGCTTCTAGA TATCACCTCC AGATTAGTCT 
14761 CCTTGGGTAC ATCAGTGATC CTGGTGATAT CCAGGGCTTC CTGATTCCAT CTTTGTCATA 
14 821 GAGGCTGCAA CTAAAGAGGT CTTCTTAATA CTTCACACCC TGATGCCAAA AGGAAGACAC 
148 81 AGAAGTTCAC AGAGGTGAAG TGATTCATGT AGGACATACA GTGAGCAAGC ATCAGGGTCC 
14 941 GGATTATCTG ACTCTACTCT AACTTTTATG TAAATGTGCT TTATGCCATT AACACTGTCA 
150 01 TTCCTGTGCT TCAGCTCTGG GAGACTCCCA AGCACTCTTA GGCACAAGCC ACAATTAAGG 
15061 GACTCTGACA CTCTGCATTG ATTAATTAGC ATGGTGGTCT CTATGTTTCC AGATTCATGA 
15121 TTGTTTCACT TTCCATATAG GCTATGAAGG GTGTGAGGAA ATTTTTTGGG GACAGAATTG 
15181 'gAGGCAATCC ACCTCTCTCA GGAAGCCTCT ATCTGGAAAA GCTT.ACAACT CAGGGACA-GT 
152 41 AACTGTAGGC CCAGTCCTTG GTGTCCAAAA TGGGTTTTAT GGTTTGAATC^GCAAAGCCT 
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Figure 23 fcontinuedV 

153 01 TCCATGTGCT CAAAGGTTTG AACATGGAGC 
15361 TTGAGACTGG ATGCTCTTTG GTCCCATGTT 
15421 GGCATGCTAC CAGCTACCAC AGACTATGCC 

154 81 TAGACTTGTA TCTCCTAAAA ATGGAATCAA 
15541 TTTCTGTTAA GTGTTTGGTC ACAGGGACAA 
15601 GAGTTGAGGT TCATTGCTCT AGCAAGTTGG 
15661 ATAAGAGACA TGTAGAAGAG TCTGAAGCTG 
15721 AATAGTTTAA TACACCATGG GAATTGTGAA 
15781 AAAACGTGAG CATGTGGCGT GTGAGAGGGC 
15841 GAAGCCATTC GGCTACGTTA GGGAACGTGT 
15901 CTGAATGAGG CCAAATTTTA AAGGAGTGGA 
15961 CAGACCACCA CTCAGGCTAT GCCGTGTTTG 
16021 TTGTGAAATT CCAGAGCAAT TATCAGAGCA 
16081 GGTGTGGGTC CCTAAGTGGA TGGTGCATAA 
16141 GATAATCCAA AATATCAGCA ATGTGGAATG 
16201 TAGAACTTTG CTCATGGCTG TAATAAATAG 
16261 GTCTGAGTTA CGGTTCCAGG GCAAACATTC 
16321 AGCCAAAGGT CAGCTGGTCA CATTGCATCA 
16381 AGGATACAGG TTATAAAACC TCACTGTCCA 
16441 TTTACCTTCT AAA6ATTTTA 6TCTTCAAAA 
16501 AAACAAATGA GCCTTTGTGG GGCATTTCAC 
16561 CCTGTGTGCA GTAGGAA6TG TGGCCTCTGT 
16621 TCTATCTGAG GGACCCTATG AAGATTCAAC 
16681 TGCTACCAAT TTGACATTTG TAGACCTGCT 
16741 CTCCCAACTT TCCAACCCAT ATTCCACATT 
16801 AGGAGAGAAG GAAGGTTAGA AGAGAAAGTG 
16861 GATTAGGGGC AAGTCCAATC GTCATTGTCA 
16921 GCAAATCAGA AACAGCAAAA GCAGCCAACA 
16981 GTAGCGTGGG AGCAGTCACT ACTGGTCTTC 
17 041 AAATTCCGTA ATTTTTTCCC CACCACCTGA 
17101 CAGCTGGCAA AAATCACATC TCTCCTAGAG 
17161 GCAATCTGAA GCATCTCAAT ATCCCACACC 
17221 CATAACTGTT TTTTTTTTCC AATTTTTTAT 
172 81 CTATCCCGAA AGTCCCCTAT ACCCTCCCAC 
17341 TTTTTGACCC TGGAGTTCCC CGGTACTGGG 
174 01 CTCTTCCCAG TGATGGCCGA CTAAGCCATC 

174 61 CTCTGGGGGT ACTAGTTAGT TCATATTGTT 
17521 GCTCCTTGGG TACTTTGTCT AGCTCCTCCA 

175 81 ACTGTGAGCA TCCACTTCTG TATTTGACAG 
17 641 TATCAGGGTC CTTTCAGCAA AACCTTGCTG 
17701 TGATTATGGG ATGGATCCAC TAGTTCTAGA 

// 



CTCCTCCTGG TAACACTGTA TTGGAGGCTT 
TTGCTACATC ATCTGTCAAG ATATGACCCA 
TCTCCAGCTT TCATGTTCTC CCCACCATGA 
AGCAAACTTT TCCTGCATTA AGTTTTTTTT 
GAAAACACTC AATACAGATA ATTAGTACCA 
ATCAAATTTT TAGGGCTTTG GAACTGATTT 
TGGGCXACAG AAGTGTCACC AGTTTTTAAG 
AATCAGAATG CTCACACAAA GGCAGACAGG 
ATAAGAAGGA ACCTAGGGGG AAATGAGCTA 
GTGGCTGTGC TTGGCCCATG CCCTGGCAAT 
CTAACTCGAT TGTCAGAGAA AATATCAAGA 
TGACCGACCA GCTACTCTTA GCCAGCTCTA 
TGAAGATACA TACAGTTTAG TGAAGIAAGG 
ATCTATGTAG GTGATGCCTA AGTGACACTT 
TCTTCCAAGG AGACCTGTAG ACACACATTT 
CTAGCTAGAA ATCATTTCCT GAAGAGGTTA 
AGTGATGGCA AGGAAGGCAT TGCAG7CAGG 
AGAGTAGAGA GTCAGAGTGT GAGTAGAAAG 
CTCTCAGCAA TCCATTTTCT CCTAAAAGGC 
CCAGTACCAG TAGCCTGGGA ACAAAAGTTG 
ACTTAAAACA GGGCATCACC TAGGAGGAGC 
GTCAGGAATG CTCAGGCTAA TAAGGGGTCC 
AAGTAGTTGT GAGAATTCCC TGTAAATGGA 
ATTGTGTGCT TCTTTATTGG GCTCTCCCAT 
AATCCCTTCC ACCACCATGC AACACTAGGT 
GGTATAGATC TATTTAGACT ACTTCCTGCT 
GGATACCTCC AACCAGCAAC CAGCAAACCA 
AGGCAGCACT AACCAGCAGG ATTGGGGTCG 
TCATGGCTTT GGCATTAATA CTCTCTCAAG 
AATTCCGTAA TTTTAAATGC AAACTATCTA 
CACAAGACAA ATCATAGTTA CTGGCTATTT 
TGGGATTAAA ACAAAAACAT ATTCACATCA 
TAGGTATTTT CTTTATTTAC ATTTCAAATG 
CTCCCTGCTC CCCTACACAC CCACTCCCAC 
GCATATAAAG TTTGCAAGAC CAAGGGGCCT 
TTCTGCTACA TATGCAGATA GAGACACGAG 
GTTCCACCTA TAGGGTCGCA GACCCCTTCA 
CTGGGGGCTC TGTGTTTTAT CTAATAGATG 
GCACTGGCCT AGCGTCACAT GAGCCAGCTA 
GCATGTGCAA TAGTGTCTGC GTTTGGTGGT 
GC 
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SEQUENCE LISTING 

<110> University of Guelph 
Porsberg, Cecil W. 
Golovan , Sergeui 
Phillips, John P- 

<12 0> Transgenic Animals Expressing Salivary Proteins 

<130> U Guelph - 03 

<140> 
<141> 

<150> 60/130,508 
<151> 1999-04-23 

<160> 7 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 20623 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ijama2/APPA 
plasmid 

<400> 1 

tcgagagtat ctttgtcagc tgtgcctcca acaaaggggt actgttgccc acatagaaag 60 
atctaaacta attaattaat ccctcacccg caaatctttc agtcactaag ttagcacgat 120 
tgttgaacaa gttctccaaa ggagagatac agatgagtgc gtatagggtg gacctggctg 180 
ctgaggagac acctgcatct gactaagaag agccacggtg ttagttgaat ggtgtggagt 240 
agggtggttc tgtgggacag tagaaaatcg agaggcatgt gccgtttagt gaactgatgg 300 
aagctacccc aaacgacaga gattgtcagt caggccaatc cgtttcgagt ttgatgggca 360 
gccggacagt gagacagaca cacctactca gttggaggaa ggatgagaac aatggccagc 420 
agggattgag agaccctgac aggcgcaagg ccctaacaca cacacctacc acctcacttg 480 
acaaagctgc caaagaccaa agacttgttc tccattagaa atgacagctg gcttgacccg 54 0 
acagcataat aagcagagtg tactctgatt ggagaacttt aatgtgtttc attcagtatt 600 
ataaaaggac agtattacag attttgttgt acactgctgt tacatgtggg gcagtgtgtc 660 
tttaagtagg gtaaagtact ctttaaaaat gggtcctaga tattttttcc tttaactcaa 720 
gtctcttact gtttaaatga tttttatttt gtttaatatg gaggaaaaag aagcgtaaat 780 
ggacaatata tatttagaga aagatggtta gctgtcagaa aaatatgcaa atcaaaatca 840 
caccaagact gcagcacacc cctgtcagat ggctgtgatc aagaaaataa atgacaatga 900 
gtggtggtga agatgtacta aagggaaaca cacacacaca cacacacaca cacacacaca 960 
cacactggag caaccactgt ggaaatcagt atgaatggtc ctcaaaaacc tgaagataga 1020 
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gcggggcgtg gtggcat:aca cttttactcc 
tctgagttcc aggccagcct ggtctatagc 
aaaaaccctg ccttgattaa accaaaccaa 
accaaaccaa accaaaccag accaaaccaa 
tcctagatat atacccaatg gagactaagt 
actacactgt tcaccacagc caggctgtgg 
ggataggtaa ctttcaaggt aaatggactc 
tcatttttct ttatgaggtg tccattcagg 
tgaagatact> acactggtcc ccacagttta 
ctatccttac catcatttgt tgtaattttt 
atgtaatatc agtgtgagga agtacaactt 
ttcttctttt gaaaactgtc ggttcctgac 
ctttggtgtt tgagttctta tgaattctag 
attctgtagg ctgcctcctc accctggcaa 
cttcatggaa tctcatttgt cagttttccc 
gtttttacag agccctggtc tatgccttta 
cttacattta gatctttgat ccactttgaa 
tctagttcca ttcttccata tgtgatccta 
ttattttatt cttaaataat gtgtcataaa 
tttctttgtc ctttgatcta caggtcttgt 
atggctctgt catacagtct gaggtcaggt 
agactcaggt ttgctttggc caggagtcat 
atgt:agctgc tactattictt agttgataaa 
gtcttgaact acttctgggg aggtgaaacg 
tgctccagta gctgtcgggt gctgggctac 
gaggtggaaa aactcagcct cccttggggt 
ggaaacctca tggagtctga aaggaagggt 
^ggggc^^ggg atctcccaaa cacctggata 
gaacagagtt gggatglicca tggacctgtg 
tggctttact aatttgcgaa agtccttagc 
gccttctgta agaggctcag gcagtgccgc 
catggtggtt cttgatgaaa gagacagtcc 
ttgtggaaaa tgggtgcaca ccaccttctc 
agggaggaat atctgggaag ggacgcttac 
cattagcatg gagaactctg ttctgggcta 
catgtgggaa gtgtggcaca tgttctaggc 
taccatccca ggtgggtgcc tgggtgccag 
ttcctggcag ggtccactgt cctacacaga 
gtggaatgtc ccatgctgct tggggctcag 
ataaaaagtg gggatacttt attattctct 
agtccaggaa ccacaccctg aggttcctgc 
tctccccttc acagagctgc caaagtctag 
gtaagcagac aacagcat:tt: gtt:tactcaa 
aagttgagac accatgctgg cttgaggaag 
gaagaagaag gggcaagtgg agttagcctg 
ccatgaaggc tcaagtggag ggcaagacct 
cctgggaacc cctctaccat gacacacatt 
tcttatttgg atctatcatg gtgttctgtg 
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cagcactggg gaggcagagg caggtggatc 1080 
acaggttcta ggacagccag ggctacacag 1140 
accaaaccaa accaaaccaa accaaaccaa 1200 
aacactgaag atagaacttc agtattccat 1260 
cagcaagaca cctgcacagc catgttcact 1320 
aaccagcctg agtgtccatg ataaatgaat 13 80 
tgctgtgtac atgcctcaca ttctgtttat 1440 
agtcacatgg tagttctatt ttcagtcttc 1500 
cacttttatc agcagtgaat aagggttcct 1560 
cttgatgacc ctctttctga cagggatagg 1620 
gttttctaag tatttattgg ccccttgcat 1680 
atctgctcag gtattcattg gatgttgttt 1740 
atgttaaatc cctgcctgtg gttctctccc 1800 
ttgttgtcct tgttttgcag aaacttttga 1860 
tcctctgcta tagcctgagc taatgcactg 192 0 
tcctcctctg gcagcttcgg agtttcattt 1980 
caagt:t:tt:gg agcagggtga gagatacgaa 2040 
gtttacatag catcgttggt tgaagaggtt 2100 
aaacgaggtg gttgtagcag tgtggatttg 2160 
tttgtgtcag tctcatgatg ttttattgct 2220 
attgtgatat accttcagta ttgctccctc 2280 
cttactcagt gctcttagag ctcccccagc 234 0 
tcaggaaact ggggctcaga gagattaact 2400 
tggagacact aaactgtgtt taccctgtac 2460 
agcaaagcac ctatact:at:a tattactcag 2520 
tcccaagctc ccaggtgtcc agtcactgct 2580 
tgagggtaca tggggcagcg atgaggagcc 264 0 
tccagatgcc actgggtcag ggggagttgg 2700 
acaaggccag ggccaggggg aggataactc 2760 
ttagcagcag ttgtctggga gcacagaggg 2 820 
tctgtaggcg aaggtcttct ccatgttccc 2880 
ttggctccaa actggtttat tgattgttca 2940 
agggtggacc agagatcaaa taccttttgc 30 00 
tggctaaacc ctcagggcct ctagatacat 3060 
catgaccaca ggccacat:t:t: ccacaagcca 3120 
caggaatctg gtagggagcg tggagccacc 3180 
ggaccctgaa cccgctcaac ct^t:accaagt 3240 
agctggagga ggtgtgaggg ttgtgtcttt 3300 
tttctccacc tgtacctcat tggtttgggt 3360 
gactcggCcc tgaggaaaaa gcatcgtggc 3420 
actgaaggga ctccctaagt ctctggagtc 3480 
gttcttttga ggataacaga gccatgcttg 3540 
ccttcttttg tcagctccct cttcataaac 3600 
acttctaaag ccagacaact gtgcaaggaa 3660 
gatgtagccc tcaaagtctc cagagaccag 3 720 
gcagcagcca agcatctggc aggagaggat 3 780 
cttcctgcag gtcacactta ataggccatt 3840 
cgagattaat gaggtgttat gctgcgaaca 3 900 
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gaaagttata taaaaacaag tccccccccc trtgtcactgc tgctaagaat gtagcagaaa 3 960 
ttgtctcaag tgtctctcta atcagaaaca ataaaggtct ccttggattc aagccctcca 4020 
gtttcctcct tccttgctga gccttggaca cccatacaaa cctcctggat gctacagctc 4080 
tgggcagaga ctccaaggtg gggagagact gatggtacaa aagcaaaata cttgtttggg 4140 
ggtacaccca ctcctctgcc tgtgtggttc ctgcagtcag tcctgcagac aggccctcag 4200 
tgggtcttcc atgggcaaca cgcagaggga ggcaatggat gggaataccc acaccctggt 4260 
tagtttaccc cggccatgct ctctgctctt catccctcct ctgccctctg ccacggcttt 4320 
ctctgcagga atcatatctt catattggcc cacaggtgtt ctcctcaccc tagctatgat 43 80 
gtttacttta gagtgacctt. agcagggctg gtgggaatga gttctagaag gctcacggag 444 0 
atgctaggga agaaacgtct tctaactact gaggttacta agttcctggt ggttgtctct 4500 
gcctttccct tgttaaagtc accttgaagt tagtgcagaa gaaatcagag cccagtcaca 4 560 
gagtaaatat ggtcctgaag atttcctt:tg agtgcccaga atccatgaca tttcaagagc 462 0 
cctctttgta ccttaagtca tttggggttg tatcttctgc ttgatgtatg tgtgtgtgtt 4 680 
tatcaaagag tgagatggtt acatiaagagg tgctctaaag gacagagagg atttgcaatt 4 74 0 
gtggcatgtg acatcctcag gccttgctct: ggtgccagga ggaactgatg cagaaaagag 4 800 
taagaggtca tttcctggag gctgtcacta tagaggagat cttacagtgc attccctcct 4 860 
ccaggccctg cctgaggata gacatgtgct gactgcaact gaaacagagg cttgggatgg 4 92 0 
agagttaggt tcacagaagg gagggtggga gatggatgct tgctgggttc tgggtctcat 4 980 
caccagctcc tgaccacccg gtcagcccat gtgcttattc catagctttc ttttgctatg 5040 
tttactcagt gtggtgtttg ttgggaccca gcagaagcca gtcccaggct gacagctgtg 5100 
gatacacagg gcagcatgag ggtcctcagc ctgaagcagt caggctggca gaagagaaag 5160 
accagcacac attccttcaa ccaactatgt: cttgaaaaac aaacatatta tatcacatat: 5220 
attgcattta tgagacagct aaaat.gtact cgggtagcat gactccaggt ggggatatct 5280 
gcaagtgcca tgagtggcag agggacagcc aatgtgaggc aagaaggaat tctggctcaa 5340 
cacagcttag ctccctggtg ttggttcaaa ctttgagagt ttgaccacaa gcactttatt 54 00 
tttgacatat ttaaacagag cacaactttg ggaaaaagtt ttcttatgaa aattatcaca 5460 
ataaagctta aggcatgact acattaaaat gcctttgcaa agcatatgtg ccctcttcca 5520 
caagaatggt tctattgact gagaaataat gttcaggata aagatccagg aagaaaagat 55 80 
cagggataag taaaatacta aactcttticg caaagtacat agaccctctt tcataacaat 564 0 
gggttctatt gactgacaag cactgctcag gagttgggaa agagtctagc ataagcacga 5700 
tagcctggag actctagtga ggtctagtct tacagacagc aaaaatcacc aggttacaaa 5 760 
ct:acattcat ttccagtttt ctgatcaggc acaggtatga atcccttctg ttgaagagaa 582 0 
aagtccatgt gtttaaaata tctggtttct ccagtgctat tagcgagaag acttgagccc 5 880 
tatacaactc ccacctggag tgacatcctg tcttcatggt atattacata cctagacacg 5940 
ctcatctcac agacttagga ctttgtc^tc tgatctccat ttctgatccc acttccacct 6000 
ttgccttgat agtgtcattt tcttcactgc cttggtgaca accatgttat cctctgtgta 6060 
tttgagtgtt accattttca gattttacct gtatgcaaga tcacacagtc tttgtctttc 6120 
tgtctggatg catgctaatc tctacacaac aacccttccc cgtcactcag atcttcctcc 6180 
attaacacat acatggtgct gaagaggcta gggagcttcc cttcagtggg gagctagctg 624 0 
gctattgggc ctttttgact gtccaggaag gcccccaatt gctgagacaa gaacttagat 63 0 0 
tcttcattat tgactctaac tcatgtatica agcagaagct aatgaatagt tatcaacagg 6360 
atcagaggtt ccagtgtaag acactttgac atgaaagaac ggaggaagga cagatggatg 6420 
cataaaagca ggaccactgc cccaggaagg tcctggaaac tgatgcaggg caaaggacag 64 80 
gttataaacc aaatcttagg gagtcaggaa gagcacagag gagctcaacc aactgaccac 654 0 
tgcttagggg ctiaccaaccc aatcctccct gtgggaacag ctaagctatc agccaagggt 6600 
aataaacagg caggacctgt ggatgacatg gagagcatag ggaccctggg tccagccttt 6660 
agcacctgca ctctcaggat actccaccat tgtgtcttag agagcctagg gatactgggt 6720 
ccagcctttg gtaccttcac tctcagggta ccccatcacc gtgtcttgga gagcctaggc 6780 
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accctgggtc cagccttcag 
cgtctcttct tcctcttcct 
tttcccctca caccctcact 
tgtggtccct ccactttcct 
ccactatact tcaggggcca 
tccaacggct cagaggagcc 
cttcgaaagc tttcagcaaa 
agttgagagt gggaaaggcc 
tctgtgtaat ccggattgac 
ttaggtgtta gtgtctttgc 
atgacagtag aggagggctt 
gctttgtggg tcttctagct 
ctctaggccc aagagacatt 
aaagggaaat aaaaaaaaaa 
tttagttgtg ataaaatggc 
gtggttcaac tgtgagaact 
gtctctctcc ttccattctt 
caatatggaa gttacagaag 
atttttggtt gtcctgcaag 
tgagggaagt ctggggagca 
ccagcctctc tggtcctagt 
agtaccagcc cattgcttiaa 
attagtagtt gtattggatg 
actggagatt ttcaatgaga 
caacagccaa gtattttcca 
aaggtgactt ggggcagtca 
caacaagggc cctctatgtt 
cagcacagcc tgcccagtgt 
gataacgagg aggtaagctg 
tcccaggtca tgtagccagc 
ccattgatag catctcctca 
tgccctttat actaaggaga 
ctggggatag ataccagcag 
tgtgttcctg gagtgtgaaa 
tgacccatag ctggaaacag 
aaagagggtc tgcatacaga 
agcttacaga attacaaggt 
gccctaatga agatggggca 
gagacccatc ctacaggcaa 
tgcagacaga agcctagcat 
agaaaaagat atccacaggc 
tgggaaggat taaaaaccct 
ctaagagacc tgtgggagct 
ggtccgaggc acctggcacg 
caataagcgg tagcctgact 
^9^99999^9 gatgccccta 
aacccaccct ctctgagaag 
ggacaaacaa gaacctcaaa 



tacctgcgct ctcaggacac 
ccctttcatt gtctcttctc 
ctagttctcc ccttccctct 
ttatctctca tgcttctctc 
gctctagtga caaagctgtt 
agacccacca agaactctict 
tgctcaggga acatgccact 
cttgcgtagg tcccatcttc 
agggctcaga acaatgtttt 
ttgcatgacc ttatgtgcat 
tgaatccctg gggataggaa 
ttcccaacag aagtgaatgc 
gctt:tatgga tataattgtg 
cttcagccgc taaggttgta 
aggtgcttca acatttatat 
g999taagtg ggtgagttct 
tcttaaagga aataaacatt 
tgaaaaaagg cattgccttg 
gaggtctggg gactggctgc 
gattccctaa ccttcagcct 
agctttttcc aaacaggaat 
gtgccagggt tagtgagggc 
taggaagtcc tatcctggga 
aatttatccc acggcccata 
ttagaggaga cttcctgtac 
gtacagactt gggatgacct 
tgctatgtaa tgtaatgtca 
gagggctctc ataggtttcc 
cagttcccag tctcacttca 
agtggaaaga atgaggattt 
caagtccctt gccaccctca 
tgcaggtaca aggggtttac 
caggcctgat gtcaccactc 
atccctactt aacaagattg 
gattctcatt gatttgtgga 
agacacgtgt ggcaaggcca 
cataatgtcc tctgctttgg 
gaagactgaa ggaatggcca 
gcatcaattc ctgacactac 
aactatcctc cgagaggtcc 
aaacagtgga tggaggtcag 
gaaggggata ggaaccccac 
ctcagagact gagccaccaa 
tgngaagcag acatgcagct 
gcagtatcca atccccaaca 
atcctgcaga gacttgatga 
ggaatgggga tgggggaggg 
taggtcaggc cctaaaggct 



cccaccattg tctcttgccc 6840 
tgtttctttc ttgactctcc 6900 
ctgcatcacc ctattctctc 6960 
ctccctcaaa tacttgtcac 7020 
aatagcaaga ctctcagatc 7080 
ccaggtccaa tttcaggttc 7140 
aacaagaaga tgcaaattcc 7200 
caggccaagg tcagaggggc 7260 
gtttttaagg tttatttatt 7320 
catgtgtgtg caggttcctg 7380 
gttacaggaa attataagct 7440 
tcttcaccac tgagccatct 7500 
tgtgtgtgtc aacattgagg 7560 
cagtttcact aattgctact 7620 
atacaaaaac ttccctgctg 7680 
ctttttctgt ctctgtctct 7740 
gcagctgggt tatagctcat 7800 
gtgggtggtg ttaccagctg 7860 
tctgtctctg tctgtatgag 7920 
ggcctggttc ctgagtgaac 7980 
ctgagtggtg acagggaaca 8040 
aggaagctgc catagctggg 8100 
cagctaatcc ttaatgcttc 8160 
tggccccatc cttttgtctc 8220 
acttgatgga tgctcattcc 8280 
ctgacagcct aacctctccc 8340 
gacattgtca ggagtgtccg 8400 
cactgtctta tctacacagg 8460 
cagaggaaga gataacccca 8520 
gaactcaggt cttccaagtc 8580 
cgatgcctta gacacttgcc 864 0 
ccatgtagca gctgaggcag 87 00 
taactccagc atccccagtc 8760 
tgcaacagtc cttggctctg 882 0 
acatggtggc agccagccaa 8880 
cagcagactc tgacl:acctt 8940 
tcacctcatg ttaaggacag 9000 
accaataact ggcccaactt 9060 
taatgatact ctgttatgct 912 0 
acccagcaac tgactgaaac 9180 
ggactattat gggagagctg 924 0 
aggaagacca acagagt:caa 9300 
ccaaagagca tacacaggcc 9360 
cagtctccat gtaggtcctc 942 0 
gggctgcata gtctggcctc 9480 
gtggagagct atccaggggg 954 0 
actctgtgaa gaggggacaa 9600 
tgctaagtag cagtggccca 9660 
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gctctgtcct gttcctcagc ccaaggctca gctcccacct gtttctgtgt ttttctggct 972 0 
tttcatgggc ctaggacttg gtgaccagtt caaacaatgg ggcctgtgga agacacaata 978 0 
tacaagacta gggacattcc tgttctgctg actatccata gcctgatgta ggtggaagga 984 0 
cccaatcact ggatttctac ccttgcacaa ccttgacagc tgagggcctc tcagaaacct 9900 
atttcttcca ctgaaaaatg agactctcaa atgaacgtcg tgacaatcat caggcttatt 9960 
aaagaggtgt atctaacctg aatggcaagc agacagcagg caaatgtctg tatcaacctc 10020 
taggaaggac aagaactgct cactgctgcc ccccaggagg ccatttgctg aaacagctgc 10080 
tctcctgctg gtgcacaggc cctgccttct cattgcagcc acagcccctt cctgtctgaa 1014 0 
cctcctgtca ggtcactggg aaacagatca agatggaaca ggacagctcc tgatggtaaa 10200 
taaaaaacag tggtcatggc tattcatagg ggtttatgct tcttcagtcc acactgtgaa 10260 
gagctgtggg catgaaccac agtgttcgag gtagagttgg ggttctgaaa ttcacagtgg 1032 0 
ggtgagctca gtaaatgtga gctggaggtc actcgtgaga cacacagtcc tgctgcttct 103 80 
gttcccaata tcctgaggag acgacacatc tactttgttc agaggccaca gtctagttga 1044 0 
cctgagagtt accagtttct tatttgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 10500 
tgttgttcgt gtgtgagtgc aggtgcacat atgatagcgt acacgttgag gtcagaggat 1056 0 
aactatcagg cgttgtcccc tcctactttt cctcggactc tggagaacaa acatgggtcc 1062 0 
ttattccagg ggagcaagtc gctgttggct gacacatctt gctcacatac attttaccta 10680 
gacaatggag cctccatcag agtattactt tagctcctca ccgatggcaa tgcaccacct 1074 0 
ctctacccac ataggagttg ggtctccaca cacccccaca cccccttcac caaaacgttt 108 00 
tcagttactt tatctggtaa agttcatcag agaatgaagc cagtattaag aacatggaat 10860 
catttgggaa cctggatcta gcaatacccc accctagatg gagttgctga gttttcacct 1092 0 
cagattataa ttccccccta gcttctatgg tttattctga aaccagggga actcgattcc 10980 
tccctttgga ccacagacat cctggcttgt gaattcacat gtcatctact gctaatccat 1104 0 
tggtagtatg tggctcacag agacacacta cagtcatggc caatgtcaag gtaggacaga 1110 0 
tgtgaatcat tcccccagtc ctgctgtttt catgactaac cctcctcagc acagtgacca 11160 
tgaacctact tttcccctcc ttttattttt agaattgctg gaattttct:a ttttgagaaa 1122 0 
taatagcctt gggcagcatt aaacaaaatc atctagaaag ctggtttaaa atacagatgg 112 80 
ttgagtcagt gaaagagtga ggaatgtcat tattggcccc tcacagaggc tggctcactc 1134 0 
cagcagaggt ggttgaagct cttggacacg ggtcaggtgc ataggaaagg tngtctggga 114 0 0 
cactgagaac cacaattgaa caaacagaac tgttggcttt ttttttttta aatgagttct 11460 
caaaaaatga ctggctagct taggcaaata cttcgagcca acccaacaga acattcttcc 1152 0 
attgattcat tctggatctt ctttctagac aatactgaac tgaccccttg ttggcagtct 1158 0 
caagtttgac aacatagggc tttgaacttg gcacaaggtc catcactgtc acccaagcat 1164 0 
cctgggtgac ctttgggttg gaatatcttg gctaacctta gatattttct ttggagtatc 117 0 0 
tttagaacat ccaggaaata gggcttgatt ctcatcctgg gaccacaata taagtcaccc 11760 
tiagaatccca ggagatcgtg cagagaaaca aggatctctc tcgtgtgcat ccttcttcaa 1182 0 
agcagtgagt agtgactcca ctaaactgag ttcccatctg agagtccaca ggaggctttg 11880 
gggcaagaag cagagggaag gcactgtttg tgttggtaaa gttttgactc taacaaattt 1194 0 
gaagacatag atgacattgt gtcagactaa caacaaccta gactcatgtg ggttctgttt 12 000 
agggatcaga ttttattcat caatgacttg tcttagtgta tagagaaagg cttcctactg 12 060 
gagtgtaggc tcaataatga cagaagagat agctatttcc cctagggact gtgctgctcc 1212 0 
aagtttggtg gagaaaggca gtggggaacc tagatgtgct ctctggggag ggggtctgaa 1218 0 
gctggcttca tagaaggtgt gaagttttgc tgaaacatct aaacagaatt atagcttagg 1224 0 
aaagtgagca ggcaaggcag ggaatgtgtt gcatatgtat atgtacatga atatattatg 12300 
ttatagatac acacacattt gaacctcatt tgcagatgac agaaaatagg ttattttgcc 1236 0 
tctcttaact gctaagcaca atgacttcca gttccatcca tttcctgaaa tgccacaatit: 1242 0 
tcatttttca ttgtggctga ataaaattcc attgcagact gggccctact tcatccactc 1248 0 
ctgagggcag gcatatcccc tggctccatt tcttacctat tgtgaagaga agtgcaactg 12 54 0 
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tcttgttgaa aggcaagcgt gagagaggca 
cctgctatga ctzctccatct gtcagaacca 
catcttaatc ccatttttat ctcttctgat 
gagtgagccg gagctgaagc tggaaagtgt 
tccaaccaag gccacgcaac tgatgcagga 
ggtaaaactg ggttggctga caccgcgcgg 
ccaacgccag cgtctggtag ccgacggatt 
tcaggtcgcg attattgctg atgtcgacga 
cgccgggctg gcacctgact gtgcaataac 
cgatccgtta tttaatcctc taaaaactgg 
tgacgcgatc ctcagcaggg caggagggtc 
ggcgtttcgc gaactggaac gggtgcttaa 
tgagaaacag gacgaaagct gttcattaac 
cgccgacaat gtctcattaa ccggtgcggt 
tctcctgcaa caagcacagg gaatgccgga 
ccagtggaac accttgctaa gtttgcal:aa 
agaggttgcc cgcagccgcg ccaccccgtt 
ccatccaccg caaaaacagg cgtatggtgt 
cggacacgat actaatctgg caaatctcgg 
cggtcagccg gataacacgc cgccaggtgg 
aagcgataac agccagtgga ttcaggtttc 
tgataaaacg ccgctgtcat taaatacgcc 
atgtgaagag cgaaatgcgc agggcatgtg 
tgaagcacgc atacccgctt gcagtttgta 
aagaggaaga acagaaggat gccacaactc 
ttacttctga tggcatttcc ctctagaaag 
gaccacccaa aggaccctcc caaattctct 
caccatccca gaattaaaat cctaactgca 
aataagagtt gttggcagtg ccaggcgtgg 
aggcagaggc aggcggattt ctgagttcga 
gacagccagg gctatacaga gaaaccc^gt 
gttggcagag tgtgggttat ataccaggtg 
ccagaaggaa cttagaggat agctcataac 
attgagagag tgggcacaca gccactgtgt 
tacatgcata agtgtatatt ggcgccatcc 
cggggttagg tggccatggc ctttcctgcc 
tatgctctct taactcttcc attgctactt 
ccttgggtac atcagtgatc ctggtgatat 
gaggctgcaa ctaaagaggt cttcttaata 
agaagttcac agaggtgaag tgattcatgt 
ggattatctg actctactct aacttttatg 
ttcctgtgct tcagctctgg gagactccca 
gactctgaca ctctgcattg attaattagc 
ttgtttcact ttccatatag gctatgaagg 
gaggcaatcc acctctctca ggaagcctct 
aactgtaggc ccagtccttg gtgtccaaaa 
tccatgtgct caaaggtttg aacatggagc 
ttgagactgg atgctctttg gccccatgtt 



ggcactaatt gtgggttttt gtttcttctt 12600 
aagatcgata aaagccgcca ccatgaaagc 12660 
tccgttaacc ccgcaatctg cattcgctca 12720 
ggtgattgtc agtcgtcatg gtgtgcgtgc 12780 
tgtcacccca gacgcatggc caacctggcc 1284 0 
tggtgagcta atcgcctatc tcggacatta 12900 
gctggcgaaa aagggctgcc cgcagtctgg 12960 
gcgtacccgt aaaacaggcg aagccttcgc 13020 
cgtacatacc caggcagata cgtccagtcc 13080 
cgtttgccaa ctggataacg cgaacgtgac 13140 
aattgctgac tttaccgggc atcggcaaac 13200 
ttttccgcaa tcaaacttgt gccttaaacg 13260 
gcaggcatta ccatcggaac tcaaggcgag 13320 
aagcctcgca tcaatgctga cggagatatt 13380 
gccggggtgg ggaaggatca ccgattcaca 1344 0 
cgcgcaattt tatttgctac aacgcacgcc 13500 
attagatttg atcaagacag cgttgacgcc 13560 
gacattaccc acttcagtgc tgtttatcgc 13 62 0 
cggcgcactg gagctcaact ggacgcttcc 13 680 
tgaactggtg tttgaacgct ggcgtcggct 13740 
gctggtcttc cagactttac agcagatgcg 13800 
gcccggagag gtgaaactga ccctggcagg 13 860 
ttcgttggca ggttttacgc aaatcgtgaa 13 920 
aggtacccgg ggatcacaac ttgccctctg 13980 
tcctgctggc tactctccag tggtttcatc 14040 
tgctactatc atccacacat ttctacctga 14100 
tcctctctga gtagtctcca cacctgttac 14160 
ctctggcgtg tgacttgcct cagtccttgc 14220 
tggcgcacgc ctttaattcc agcacttggg 142 80 
ggccagcctg gtctacagag tgagttccag 14340 
gtcgaaaaac caaaaaaaaa aaaaaaag^t 14400 
gagatttcaa atgagtggct gaagctgtag 144 60 
ttaaaaagaa atgtagagag tagcagaaac 14520 
gaatgtggca gaacacaatc cagccagcta 14580 
tgactgatga gacacaggaa aacagataga 14640 
tgcctcttcc taagggtcat ctcaagacct 14700 
agcttctaga tatcacctcc agattagtct 14760 
ccagggcttc ctgattccat ctttgtcata 14820 
cttcacaccc tgatgccaaa aggaagacac 14880 
aggacataca gtgagcaagc atcagggtcc 14940 
taaatgtgct ttatgccatt aacactgtca 15000 
agcactctta ggcacaagcc acaat:taagg 15060 
atggtggtct ctatgtttcc agattcatga 15120 
gtgtgaggaa attttttggg gacagaattg 15180 
atctggaaaa gcttacaact cagggacagt 15240 
tgggttttat ggtttgaatc tgcaaagcct 15300 
ctcctcctgg taacactgta ttggaggctt 15360 
ttgctacatc atctgtcaag atatgaccca 15420 
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ggcatgctac cagctaccac agactatgcc t:ct:ccagctt tcatgttctc cccaccatga 15480 
tagacttgta tctcctaaaa atggaatcaa agcaaacttt tcctgcatta agtttttttt 1554 0 
tt^ctgttaa gtgtttggtc acagggacaa gaaaacactc aatacagata attagtacca 15600 
gagttgaggt tcattgctct agcaagttgg a^caaat:ttt tagggctttg gaactgattt 15660 
ataagagaca tgtagaagag tctgaagctg tgggctacag aagtgtcacc agtttttaag 1572 0 
aatagtttaa tacaccatgg gaattgtgaa aatcagaatg ctcacacaaa ggcagacagg 15780 
aaaacgtgag catgtggcgt gtgagagggc ataagaagga acctaggggg aaatgagcta 1584 0 
gaagccattc ggctacgtta gggaacgtgt gtggctgtgc ttggcccatg ccctggcaat 15900 
ctgaatgagg ccaaatttta aaggagtgga ctaactcgat tgtcagagaa aatatcaaga 15960 
cagaccacca ctcaggctat gccgtgtttg t:gaccgacca gctactctta gccagctcta 16020 
ttgtgaaatt ccagagcaat tatcagagca tgaagataca tacagtttag tgaagtaagg 16080 
ggtgtgggtc cctaagtgga tggtgcataa atctatgtag gtgatgccta agtgacactt 1614 0 
gataatccaa aatatcagca atgtggaatg tct^ccaagg agacctgtag acacacattt 16200 
tagaactttg ctcatggctg taataaatag ctagctagaa atcatttcct gaagaggtta 16260 
gtctgagtta cggttccagg gcaaacattc agtgatggca aggaaggcat tgcagtcagg 16320 
agccaaaggt cagctggtca cattgcatca agagtagaga gtcagagtgt gagtagaaag 16380 
aggatacagg ttataaaacc tcactgtcca ctctcagcaa tccattttct cctaaaaggc 1644 0 
tttaccttct aaagattt^ta gtcttcaaaa ccagtaccag tagcctggga acaaaagttg 16500 
aaacaaatga gcctttgtgg ggcatttcac acttaaaaca gggcatcacc taggaggagc 16560 
cctgtgtgca gtaggaagtg tggcctctgt gtcaggaatg ctcaggctaa taaggggtcc 1662 0 
tctatctgag ggaccct:atg aagattcaac aag^ag^tgt gagaattccc tgtiaaatgga 16680 
tgctaccaat ttgacatttg tagacctgct attgtgtgct tctttattgg gctctcccat 1674 0 
ctcccaactt tccaacccat attccacatt: aatccc^tcc accaccatgc aacactaggt 16800 
aggagagaag gaaggttaga agagaaagtg ggtatagatc tat:ttagact acttcctgct 16860 
gattaggggc aagtccaatc gtcattgtca ggat^acctcc aaccagcaac cagcaaacca 1692 0 
gcaaatcaga aacagcaaaa gcagccaaca aggcagcact aaccagcagg attggggtcg 16980 
gtagcgtggg agcagtcact actggtcttc tcatggcttt ggcattaata ctctctcaag 17040 
aaattccgta attttttccc caccacctga aattccgtaa ttttaaatgc aaactatcta 17100 
cagctggcaa aaatcacatc tctcctagag cacaagacaa atcatagtta ctggctattt 17160 
gcaatctgaa gcatctcaat atcccacacc tgggattaaa acaaaaacat attcacatca 17220 
cataactgtt ttttttttcc aattttttat taggtatttt ctttatttac atttcaaatg 17280 
ctatcccgaa agtcccctat accctcccac ctccctgctc ccctacacac ccactcccac 17340 
tttttgaccc tggagttccc cggtactggg gcatataaag tttgcaagac caaggggcct 17400 
ctcttcccag tgatggccga ctaagccatc ttctgctaca tatgcagata gagacacgag 17460 
ctctgggggt actagttagt tcatattgtt: gttccaccta tagggtcgca gaccccttca 17520 
gctccttggg tactttgtct agctcctcca ctgggggctc tgtgttttat ctaatagatg 17580 
actgtgagca tccacttctg tatttgacag gcactggcct agcgtcacat gagccagcta 1764 0 
tatcagggtc c^ttcagcaa aaccttgctg gcatgtgcaa tagtgtctgc gtttggtggt 17700 
tgattatggg atggatccac tagttctaga gcggccgcca ccgcggtgga gctccagctt 17760 
ttgttccctt tagtgagggt taattgcgcg cttggcgtaa tcatggtcat agctgtttcc 1782 0 
tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 17880 
taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 1794 0 
cgcttticcag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 18000 
gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact: cgctgcgctc 18060 
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 18120 
agaatcaggg gataacgcag gaaagaaca^ gtgagcaaaa ggccagcaaa aggccaggaa 18180 
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 1824 0 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 18300 
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gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 18360 
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 18420 
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 18480 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 1854 0 
cttatcgcca ctggcagcag ccactiggtaa caggattagc agagcgaggt atgtaggcgg 18600 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 18660 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 18720 
caaacaaacc accgctggta gcggt:ggt:tt ttttgtttgc aagcagcaga ttacgcgcag 18760 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 18840 
cgaaaactca cgttaaggga tcttggtcat gagattatca aaaaggatct tcacctagat 18900 
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 18960 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 19020 
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 19080 
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 19140 
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 192 00 
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 19260 
gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 19320 
ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 193 80 
aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 19440 
atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 19500 
cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 19560 
gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 19620 
agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 19680 
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 1974 0 
caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 19800 
ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 19860 
tcagggttat tgtctcatga gcggat:acat atttgaatgt atttagaaaa ataaacaaat 1992 0 
aggggttccg cgcacatttc cccgaaaagt gccacctaaa ttgtaagcgt tiaatattttg 19980 
ttaaaattcg cgttaaattt ttgttaaatc agctcatttt ttaaccaata ggccgaaatc 2 0 040 
ggcaaaatcc cttataaatc aaaagaatag accgagatag ggttgagtgt tgttccagtt 2 0100 
tggaacaaga gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc 20160 
tatcagggcg atggcccact acgtgaacca tcaccctaat caagtttttt ggggtcgagg 2 022 0 
tgccgtaaag cactaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga 20280 
aagccggcga acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg 2034 0 
ctggcaagtg tagcggtcac gctgcgcgta accaccacac ccgccgcgct taatgcgccg 2 0400 
ctacagggcg cgtcccattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 20460 
cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 20520 
tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgagcgcgcg 2 0580 
taatacgact cactataggg cgaattgggt accgggcccc ccc 2 0623 
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intron plasmid with pBLCAT3 vector 
<400> 2 

ggatcccctt tgctatgtag tttttaatgg aaattacaac ccatagtgtg ttgataaata 60 
gagagtcctg tttggtttaa gcaacctctg tttctcataa actccataaa aacaggaata 120 
ctctttgttt ctagcataac caaaagattt agtgaattga aaacaatgtt cccttagagt 180 
ataggtctaa taaccccgaa aatattacca tgatactgag catttgtaag tatctcatag 240 
catgtagtat ccatagtcca tcaatgagag agacatttaa catgattttc attaatcagg 3 00 
tggaaaagac atgacaacat tcacaggcac tgcacagaac atagtggtcc accttgcaca 360 
tatttcacta aactaggttt atctattttg ttgctttctc taacatctct gcaatgaagc 420 
aggtcaacag tgccacatat cctttactta acctaaggaa cacaaaaaat tttctacata 4 80 
tatcctggtt agagagtgct taaaataagt tttccaagaa tggaaaagaa atgttctgac 540 
ttaacaatta agacagtatt tatttaaagc aagaaatatg aggcacacaa gaaaatattt 600 
tgggaagaaa ccatttggtg aacaatattt caaataaaaa tagacaaaca tagttaattg 660 
taaaacatat gtttgaccag cccttctttt caataggctt aatgtgaata aaatgttaaa 72 0 
gattctcttt gggtggctgc aaattgtcca cgaataagac aaaatataaa aataaggact 7 80 
gagtctcaca aaatgaaaag gaaatatatt cagaaagaga atcttgagag aatgtgttgt 84 0 
cacaaattaa agaaaacctg tggtgaatga catcctgagg cctgagctat tactgacatt 900 
taagataaag gtaactgtat acatttgtcc cattgagggg acaagaaagc tgctctcatg 960 
ttcagctcta taattcttgc cttaaacaac ttaaatagaa tgatttaaaa tatggagctg 102 0 
tccatggacc tttgaaatat aaaatagtca agcaacttat caaggaatta cagattcctt 1080 
gatactaaca caggtaaatc ccacacgtgt tttgagacta catttgctgg gattttattg 1140 
atgtaatagg tcacatgttt ttcgggccaa tgttgctgtt attcggttac ttcaagagaa 1200 
tagtggcaac tgatgctatg tattctaggg gtttgaagtg atgtttcatg attgaaattt 1260 
gtaaaagaat aacatcatca ttcttaacaa tagaacatat aaagtcacac agaagtgaca 1320 
gtgtttaagc tgtactattg atcaaagaaa tttattacct tcagtttcaa tggaaataat 13 80 
tactgataat acaaacatgt gtgaacacac actaatccta tccaaatgca cagtgataca 1440 
cagaaaatat tagcaagtag aatgcaatat ttatataacg attgtattta tcaatcaatt 1500 
gtatgtatca atatatgggc tattttctta cacatgattt tattcaaatt tactctaatc 1560 
attgttgaac catttagaaa aggcatactg gcaacttttc cttacctcat ccagctgggc 1620 
aaaagtccca gtgtggagta aaggatgcaa gatttcctgc tctgttaagt ataaaataat 1680 
agtatgaatt caaaggtgcc attcttctgc ttctagttat aaaggcagtg cttgcttctt 1740 
ccagcacaga tctggatctc gaggagcttg gcgagatttt caggagctaa ggaagctaaa 1800 
agccgccacc atgaaagcca tcttaatccc atttttatct cttctgattc cgttaacccc 1860 
gcaatctgca ttcgctcaga gtgagccgga gctgaagctg gaaagtgtgg tgattgtcag 1920 
tcgtcatggt gtgcgtgctc caaccaaggc cacgcaactg atgcaggatg tcaccccaga 1980 
cgcatggcca acctggccgg taaaactggg ttggctgaca ccgcgcggtg gtgagctaat 2 04 0 
cgcctatctc ggacattacc aacgccagcg tctggtagcc gacggattgc tggcgaaaaa 2100 
gggctgcccg cagtctggtc aggtcgcgat tattgctgat gtcgacgagc gtacccgtaa 2160 
aacaggcgaa gccttcgccg ccgggctggc acctgactgt gcaataaccg tacataccca 2220 
ggcagatacg tccagtcccg atccgttatt taatcctcta aaaactggcg tttgccaact 2280 
ggataacgcg aacgtgactg acgcgatcct cagcagggca ggagggtcaa ttgctgactt 234 0 
taccgggcat cggcaaacgg cgtttcgcga actggaacgg gtgcttaatt ttccgcaatc 2400 
aaacttgtgc cttaaacgtg agaaacagga cgaaagctgt tcattaacgc aggcattacc 2460 
atcggaactc aaggtgagcg ccgacaatgt ctcattaacc ggtgcggtaa gcctcgcatc 252 0 
aatgctgacg gagatatttc tcctgcaaca agcacaggga atgccggagc cggggtgggg 2580 
aaggatcacc gattcacacc agtggaacac cttgctaagt ttgcataacg cgcaatttta 2640 
tttgctacaa cgcacgccag aggttgcccg cagccgcgcc accccgttat tagatttgat 2 700 



00e4247A1_l_> 



wo 00/64247 



PCT/CAOO/00430 



caagacagcg ttgacgcccc atccaccgca 
ttcagtgctg tttatcgccg gacacgatac 
gctcaactgg acgcttcccg gtcagccgga 
tgaacgctgg cgtcggctaa gcgataacag 
gactttacag cagatgcgtg ataaaacgcc 
gaaactgacc ctggcaggat gtgaagagcg 
ttttacgcaa atcgtgaatg aagcacgcat 
gttattggtg cccttaaacg cctggtgcta 
tggcagaaat tcgccggatc tttgtgaagg 
acaaactacc tacagagatt taaagctcta 
gtgttaaact actgattcta attgtttgtg 
aatgggagca gtggtggaat gcctttaatg 
catctagtga tgatgaggct actgctgact 
gaaaggtaga agaccccaag gactttcctt 
tgtttagtaa tagaactctt gcttgctttg 
tgctatacaa gaaaattatg gaaaaatatt 
ataatcataa catactgttt tttcttactc 
actatgctca aaaattgtgt acctttagct 
atttgatgta tagtgccttg actagagatc 
ttacttgctt taaaaaacct cccacacctc 
attgttgttg ttaacttgtt tattgcagct 
acaaatttca caaataaagc atttttttca 
atcaatgtat cttatcatgt ctggatcgat 
tggtcatagc tgtttcctgt gtgaaattgt 
gccggaagca taaagtgtaa agcctggggt 
gcgttgcgct cactgcccgc tttccagtcg 
atcggccaac gcgcggggag aggcggtttg 
actgactcgc tgcgctcggt cgttcggctg 
gtaatacggt tatccacaga atcaggggat 
cagcaaaagg ccaggaaccg taaaaaggcc 
ccccctgacg agcaticacaa aaatcgacgc 
ctataaagat accaggcgtt tccccctgga 
ctgccgctta ccggatacct gtccgccttt 
tgctcacgct gtaggtatct cagttcggtg 
cacgaacccc ccgttcagcc cgaccgctgc 
aacccggtaa gacacgactt atcgccactg 
gcgaggtatg taggcggtgc tacagagttc 
agaaggacag tatttggtat ctgcgctctg 
ggtagctictt gatccggcaa acaaaccacc 
cagcagatta cgcgcagaaa aaaaggatct 
tctgacgctc agtggaacga aaactcacgt 
aggatcttca cctagatcct tttaaattaa 
tatgagtaaa cttggtctga cagttaccaa 
atctgtctat ttcgttcatc catagttgcc 
cgggagggct taccatctgg ccccagtgct 
gctccagatt tatcagcaat aaaccagcca 
gcaactttat ccgcctccat ccagtctatt 
tcgccagtta atagtttgcg caacgttgtt 



aaaacaggcg tatggtgt^ga cattacccac 2 760 
taatctggca aatctcggcg gcgcactgga 2820 
taacacgccg ccaggtggtg aactggtgtt 2880 
ccagtggatt caggtttcgc tggtcttcca 2940 
gctgtcatta aatacgccgc ccggagaggt 3000 
aaatgcgcag ggcatgtgtt cgttggcagg 3060 
acccgcttgc agtttgtaag gtataaggca 3120 
cgcctgaata agtgataata agcggatgaa 3180 
aaccttactt ctgtggtgtg acataattgg 3 240 
aggtaaatat aaaatttttia agtgtat.aat 3300 
tattttagat tccaacctat ggaactgatg 3360 
aggaaaacct gttttgctca gaagaaatgc 342 0 
ctcaacattc tactcctcca aaaaagaaga 3480 
cagaattgct aagttttttg agtcatgctg 3540 
ctatttacac cacaaaggaa aaagctgcac 3 600 
ctgtaacctt tataagtagg cataacagtt 3660 
cacacaggca tagagtgtct gctattaata 3720 
ttttaatttg taaaggggtt aataaggaat 3780 
ataatcagcc ataccacatit tgtagaggtt 384 0 
cccctgaacc tgaaacataa aatgaatgca 3900 
tataatggtt acaaataaag caatagcatc 3 960 
ctgcattcta gttgtggttt gtccaaactc 4020 
ccccgggtac cgagctcgaa ttcgtaatca 4080 
t:atccgctca caattccaca caacatacga 4140 
gcct:aat:gag tgagctaact cacattaatt 4200 
ggaaacctgt cgtgccagct gcattaatga 4260 
cgtattgggc gctcttccgc ttcctcgctc 4320 
C99C9^9c^99 tat cage tea ctcaaaggcg 4380 
aacgcaggaa agaacatgtg agcaaaaggc 444 0 
gcgttgctgg cgtttttcca taggctccgc 4 500 
tcaagtcaga ggtggcgaaa cccgacagga 4560 
agctccctcg tgcgctctcc tgttccgacc 4620 
ctcccttcgg gaagcgtggc gctttctcaa 4680 
taggtcgttc gctccaagct gggctgtgtg 4740 
gccttatccg gtaactatcg tcttgagtcc 4 8 00 
gcagcagcca ctggtaacag gattagcaga 4860 
ttgaagtggt ggcctaacta cggctacact 4920 
ctgaagccag ttaccttcgg aaaaagagtt 4 980 
gctggtagcg gtggtttttt tgtttgcaag 5040 
caagaagatc ctttgatctt ttctacgggg 5100 
taagggattt tggtcatgag attatcaaaa 5160 
aaatgaagtt ttaaatcaat ctaaagtata 5220 
tgcttaatca gtgaggcacc tatctcagcg 5280 
tgactccccg tcgtgtagat aactacgata 5340 
gcaatgatac cgcgagaccc acgctcaccg 5400 
gccggaaggg ccgagcgcag aagtggtcct 54 60 
aattgttgcc gggaagctag agtaagtagt 5520 
gccattgcta caggcatcgt ggtgtcacgc 5580 
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tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga 5640 
tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt 5700 
aagttggccg cagtgttatc actcatggtt atggcagcac tigcataattc tcttactgtc 5760 
atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa 5820 
tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca 5 880 
catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca 5940 
aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct 6000 
tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc 6060 
gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa 6120 
tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt 6180 
tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc 6240 
taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac gaggcccttt 63 00 
cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct cccggagacg 6360 
gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg 6420 
ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat cagagcagat tgtactgaga 64 8 0 
gtgcaccata tgcggtgtga aataccgcac agacgcgtaa ggagaaaata ccgcatcagg 654 0 
cgccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg 6600 
ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca 6660 
gggttttccc agtcacgacg ttg^aaaacg acggccagtg ccaagctt 6708 



<210> 3 
<211> 4060 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: R15/APPA + 
intron transgene 

<400> 3 

ggatcccctt tgctatgtag tttttaatgg aaattacaac ccatagtgtg ttgataaata 60 
gagagtcctg tttggtttaa gcaacctctg tttctcataa actccataaa aacaggaata 120 
ctctttgttt ctagcataac caaaagattt agtgaattga aaacaatgtt cccttagagt 180 
ataggtctaa taaccccgaa aatattacca tgatactgag catttgtaag tatctcatag 240 
catgtagtat ccatagtcca tcaatgagag agacatttaa catgattttc attaatcagg 300 
tggaaaagac atgacaacat tcacaggcac tgcacagaac atagtggtcc accttgcaca 3 60 
tatttcacta aactaggttt atctattttg ttgctttctc taacatctct gcaatgaagc 420 
aggtcaacag tgccacatat cctttactta acctaaggaa cacaaaaaat tttctacata 480 
tatcctggtt agagagtgct taaaataagt tttccaagaa tggaaaagaa atgttctgac 54 0 
ttaacaatta agacagtatt tatttaaagc aagaaatatg aggcacacaa gaaaatattt 600 
tgggaagaaa ccatttggtg aacaatattt caaataaaaa tagacaaaca tagttaattg 660 
taaaacatat gtttgaccag cccttctttt caataggctt aatgtgaata aaatgttaaa 720 
gattctcttt gggtggctgc aaattgtcca cgaataagac aaaatataaa aataaggact 780 
gagtctcaca aaatgaaaag gaaatatatt cagaaagaga atcttgagag aatgtgttgt 84 0 
cacaaattaa agaaaacctg tggtgaatga catcctgagg cctgagctat tactgacatt 900 
taagataaag gtaactgtat acatttgtcc cattgagggg acaagaaagc tgctctcatg 960 
ttcagctcta taattcttgc cttaaacaac ttaaatagaa tgatttaaaa tatggagctg 1020 
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tccatggacc tttgaaatat aaaatagtca agcaacttat caaggaatta cagattcctt 108 0 
gat:actaaca caggtaaatc ccacacgtgt: t:ttgagacta catttgctgg gattttattg 1140 
atgtaatagg tcacatgttt ttcgggccaa tgttgctgtt attcggttac ttcaagagaa 1200 
tagtggcaac tgatgctatg tattctaggg gtttgaagtg atgtttcatg attgaaattt 1260 
gtaaaagaat aacatcatca ttcttaacaa tagaacatat aaagtcacac agaagtgaca 1320 
gtgtttaagc tgtactattg atcaaagaaa tttattacct tcagtttcaa tggaaataat 1380 
tactgataat acaaacatgt gtgaacacac actaatccta tccaaatgca cagtgatiaca 1440 
cagaaaatat tagcaagtag aatgcaatat Ctatataacg attgtattta tcaatcaatt 1500 
gtatgtatca atatatgggc tattttctta cacatgattt tattcaaatt tactctaatc 1560 
attgttgaac catttagaaa aggcatactg gcaacttttc cttacctcat ccagctgggc 1620 
aaaagtccca gtgtggagta aaggatgcaa gatttcctgc tctgttaagt ataaaatiaat 1680 
agtatgaatt caaaggtgcc attcttctgc ttctagttat aaaggcagtg cttgcttctt 1740 
ccagcacaga tctggatctc gaggagcttg gcgagatttt caggagctaa ggaagct:aaa 1800 
agccgccacc atgaaagcca tcttaatccc atttttatct cttctgattc cgttaacccc 1860 
gcaatctgca ttcgctcaga gtgagccgga gctgaagctg gaaagtgtgg tgattgtcag 192 0 
ticgtcatggt gt:gcgtgctc caaccaaggc cacgcaactg atgcaggatg tcaccccaga 1980 
cgcatggcca acctggccgg taaaactggg ttggctgaca ccgcgcggtg gtgagc^aat 2040 
cgcctatctc ggacattacc aacgccagcg tctggtagcc gacggattgc tggcgaaaaa 2100 
gggctgcccg cagtctggtc aggtcgcgat tattgctgat gtcgacgagc gtacccgtaa 2160 
aacaggcgaa gccttcgccg ccgggctggc acctgactgt gcaataaccg tacataccca 2220 
ggcagatacg tccagtcccg atccgttatt taatcctcta aaaactggcg tttgccaact 22 80 
ggataacgcg aacgtgactg acgcgatcct cagcagggca ggagggtcaa ttgctgactt 2340 
taccgggcat cggcaaacgg cgtttcgcga actggaacgg gtgcttaatt ttccgcaatc 2400 
aaacttgtgc cttaaacgtg agaaacagga cgaaagctgt tcattaacgc aggcattacc 2460 
atcggaactc aaggtgagcg ccgacaatgt ct:cat:t:aacc ggtgcggtaa gcct:cgcatc 2520 
aatgctgacg gagatatttc tcctgcaaca agcacaggga atgccggagc cggggtgggg 25 80 
aaggatcacc gattcacacc agtggaacac cttgc^aagt ttgcataacg cgcaatt^l^a 2640 
^t:tgctacaa cgcacgccag aggttgcccg cagccgcgcc accccgttat tagatttgat 2700 
caagacagcg ttgacgcccc atccaccgca aaaacaggcg tatggtgtga cattacccac 2760 
ttcagtgctg tttatcgccg gacacgatac taatctggca aatctcggcg gcgcactgga 2820 
gctcaactgg acgcttcccg gtcagccgga taacacgccg ccaggtggt:g aactggtgtt: 2880 
tgaacgctgg cgtcggctaa gcgataacag ccagtggatt caggtttcgc tggtcttcca 294 0 
gactttacag cagatgcgtg ataaaacgcc gctgtcatta aatacgccgc ccggagaggt 3000 
gaaactgacc ctggcaggat gtgaagagcg aaatgcgcag ggcatgtgtt cgttggcagg 3060 
t:t:ttacgcaa atcgtgaatg aagcacgcat acccgcttgc agtttgtaag gtat:aaggca 3120 
gttattggtg cccttaaacg cctggtgcta cgcctgaata agtgataata agcggatgaa 3180 
tggcagaaat tcgccggatc tttgtgaagg aaccttactt ctgtggtgtg acataattgg 3240 
acaaactacc tacagagatt taaagctcta aggtaaatat aaaattttta agtgtataat 33 00 
gtg^taaact actgattc^a attgtttgtg tattttiagat tccaacctat ggaactgatg 3360 
aatgggagca gtggtggaat gcctttaatg aggaaaacct gttttgctca gaagaaatgc 3420 
catctagtga tgatgaggct actgctgact ctcaacattc tactcctcca aaaaagaaga 3480 
gaaaggtaga agaccccaag gactttcctt cagaatitgct aagttttttg agtcatgctig 3540 
tgtttagtaa tagaactctt gcttgctttg ctatttacac cacaaaggaa aaagctgcac 3 600 
tigctatacaa gaaaattatg gaaaaatatt ctgtaacctt tataagtagg cataacagtt 3660 
ataatcataa catactgttt tttcttactc cacacaggca tagagtgtct gctatt:aata 3720 
actatgctca aaaattgtgt acctttagct ttttaatttg taaaggggtt aataaggaat 3780 
atttgatgta tagtgccttg actagagatc ataat:cagcc ataccacatt tgtiagaggtt 384 0 
ttacttgctt tiaaaaaacct cccacacctc cccct:gaacc tgaaacataa aatgaatgca 3900 
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attgttgttg ttaacttgtt tattgcagct tataatggtt acaaataaag caatagcatc 3 960 
acaaats^tca caaataaagc atttt^ttca ctgcattcta gt:^gt:ggttt gtccaaactc 4020 
atcaatgtat cttatcatgt ctggatcgat ccccgggtac 4060 



<210> 4 
<211> 6116 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: R15/APPA 
plasmid with pBLCAT3 vector 

<400> 4 

ggatcccctt tgctatgtag tttttaatgg aaattacaac ccatagtgtg ttgataaata 60 
gagagtcctg tttggtttaa gcaacctctg tttctcataa actccataaa aacaggaata 120 
ctctttgttt ctagcataac caaaagattt agtgaattga aaacaatgtt cccttagagt 180 
ataggtctaa taaccccgaa aatattacca tgatactgag catttgtaag tatctcatag 240 
catgtagtat ccatagtcca tcaatgagag agacatttaa catgattttc attaatcagg 3 00 
tggaaaagac atgacaacat tcacaggcac tgcacagaac atagtggtcc accttgcaca 360 
tatttcacta aactaggttt atctattttg ttgctttctc taacatctct gcaatgaagc 420 
aggtcaacag tgccacatat cctttactta acctaaggaa cacaaaaaat tttctacata 4 80 
tatcctggtt agagagtgct taaaataagt tttccaagaa tggaaaagaa atgttctgac 54 0 
ttaacaatta agacagtatt tatttaaagc aagaaatatg aggcacacaa gaaaatattt 600 
tgggaagaaa ccatttggtg aacaatattt caaataaaaa tagacaaaca tagttaattg 660 
taaaacatat gtttgaccag cccttctttt caataggctt aatgtgaata aaatgttaaa 72 0 
gattctcttt gggtggctgc aaattgtcca cgaataagac aaaatataaa aataaggact 780 
gagtctcaca aaatgaaaag gaaatatatt cagaaagaga atcttgagag aatgtgttgt 840 
cacaaattaa agaaaacctg tggtgaatga catcctgagg cctgagctat tactgacatt 900 
taagataaag gtaactgtat acatttgtcc cattgagggg acaagaaagc tgctctcatg 960 
ttcagctcta taattcttgc cttaaacaac ttaaatagaa tgatttaaaa tatggagctg 1020 
tccatggacc tttgaaatat aaaatagtca agcaacttat caaggaatta cagattcctt 1080 
gatactaaca caggtaaatc ccacacgtgt tttgagacta catttgctgg gattttattg 114 0 
atgtaatagg tcacatgttt ttcgggccaa tgttgctgtt attcggttac ttcaagagaa 1200 
tagtggcaac tgatgctatg tattctaggg gtttgaagtg atgtttcatg attgaaattt 1260 
gtaaaagaat aacatcatca ttcttaacaa tagaacatat aaagtcacac agaagtgaca 132 0 
gtgtttaagc tgtactattg atcaaagaaa tttattacct tcagtttcaa tggaaataat 13 80 
tactgataat acaaacatgt gtgaacacac actaatccta tccaaatgca cagtgataca 144 0 
cagaaaatat tagcaagtag aatgcaatat ttatataacg attgtattta tcaatcaatt 1500 
gtatgtatca atatatgggc tattttctta cacatgattt tattcaaatt tactctaatc 1560 
attgttgaac catttagaaa aggcatactg gcaacttttc cttacctcat ccagctgggc 162 0 
aaaagtccca gtgtggagta aaggatgcaa gatttcctgc tctgttaagt ataaaataat 1680 
agtatgaatt caaaggtgcc attcttctgc ttctagttat aaaggcagtg cttgcttctt 174 0 
ccagcacaga tctggatctc gaggagcttg gcgagatttt caggagctaa ggaagctaaa 1800 
agccgccacc atgaaagcca tcttaatccc atttttatct cttctgattc cgttaacccc 1860 
gcaatctgca ttcgctcaga gtgagccgga gctgaagctg gaaagtgtgg tgattgtcag 1920 
tcgtcatggt gtgcgtgctc caaccaaggc cacgcaactg atgcaggatg tcaccccaga 1980 
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cgcatggcca acctggccgg taaaacCggg ttggctgaca ccgcgcggtg gtgagctaat 2040 
cgcctatctc ggacattacc aacgccagcg tctggtagcc gacggattgc tggcgaaaaa 2100 
gggctigcccg cagtctggtc aggtcgcgat tattgctgat gtcgacgagc gtacccgtaa 2160 
aacaggcgaa gccttcgccg ccgggctggc acctgactgt gcaataaccg tacataccca 2220 
ggcagatacg tccagtcccg atccgt^att taatcctcta aaaactggcg tttgccaact 2280 
ggataacgcg aacgtgactg acgcgat:cct cagcagggca ggagggtcaa ttgctgactt 2340 
taccgggcat cggcaaacgg cgtttcgcga actggaacgg gtgcttaatt ttccgcaatc 2400 
aaacttgtgc cttaaacgtg agaaacagga cgaaagctgt tcattaacgc aggcattacc 2460 
atcggaactc aaggtgagcg ccgacaatgt ctcattaacc ggtgcggtaa gcctcgcatc 252 0 
aatgctgacg gagatatttc tcctgcaaca agcacaggga atgccggagc cggggtgggg 2580 
aaggatcacc gattcacacc agtggaacac cttgctaagt ttgcataacg cgcaatttta 2640 
tttgctacaa cgcacgccag aggttgcccg cagccgcgcc accccgttat tagatttgat 2700 
caagacagcg ttgacgcccc atccaccgca aaaacaggcg tatggtgtga cattacccac 2760 
ttcagtgctg tttatcgccg gacacga^ac taatctggca aatctcggcg gcgcactgga 2 82 0 
gctcaactgg acgcttcccg gtcagccgga taacacgccg ccaggtggtg aactggtgtt 2880 
tgaacgctgg cgtcggctaa gcgataacag ccagtggatt caggtttcgc tggtcttcca 294 0 
gactttacag cagatgcgtg ataaaacgcc gctgtcatta aatacgccgc ccggagaggt 3000 
gaaactgacc ctggcaggat gtgaagagcg aaatgcgcag ggcatgtgtt cgttggcagg 3 060 
ttttacgcaa atcgtgaatg aagcacgcat acccgcttgc agtttgtaag gtataaggca 3120 
gttattggtg cccttaaacg cctggtgcta cgcctgaata agtgataata agcggatgaa 318 0 
tggcagaaat tcgccggatc tttgtgaagg aaccttactt ctgtggtgtg acataattgg 3240 
acaaactacc tacagagatt taaaaaacct cccacacctc cccctgaacc tgaaacataa 3300 
aatgaatgca attgttgttg ttaacttgtt tattgcagct tataatggtt acaaataaag 3 360 
caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt 342 0 
gtccaaactc atcaatgtat cttatcatgt ctggatcgat ccccgggtac cgagctcgaa 3480 
ttcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgctca caattccaca 354 0 
caacatacga gccggaagca taaagtgt:aa agcctggggt gcctaatgag tgagctaact 3 600 
cacattaatt gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct 3660 
gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc gctcttccgc 372 0 
ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tat cage tea 3780 
ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg 3 840 
agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca 3 900 
taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa 3960 
cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc 4020 
tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc 4 080 
gctttctcaa tgctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct 4140 
gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg 4200 
tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag 4260 
gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta 432 0 
cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg 4 3 80 
aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt 444 0 
tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt 4500 
ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag 4 56 0 
attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat 4 620 
ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc 4680 
tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat 4 74 0 
aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc 4800 
acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag 4 860 
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aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag 492 0 
agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt 4 98 0 
ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg 5040 
ag^tacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt 5100 
tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc 5160 
tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc 5220 
atltctgagaa tagtgtatgc ggcgaccgag ttgctcttzgc ccggcgtcaa tacgggataa 5280 
taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg 534 0 
aaaactctca aggatcttac cgctgtCgag atccagttcg atgtaaccca ctcgtgcacc 5400 
caactgatct t:cagcatctt ttactrttcac cagcgtttct gggtgagcaa aaacaggaag 5460 
gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt 552 0 
cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt 5580 
tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc 5640 
acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac 5700 
gaggcccttt cgtctcgcgc gtttcggtiga tgacggt.gaa aacctctgac aca^gcagct 5760 
cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 5820 
cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat cagagcagat 5880 
tg^actgaga gtgcaccata tgcggtgtga aataccgcac agatgcgtaa ggagaaaata 5940 
ccgcatcagg cgccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg 6000 
ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg 6060 
ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg ccaagc 6116 



<210> 5 

<211> 3470 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: R15/APPA 
transgene 

<400> 5 

ggatcccctt tgctatgtag tttttaatgg aaattacaac ccatagtgtg ttgataaata 60 
gagagtcctg tttggtttaa gcaacctctg tttctcataa actccataaa aacaggaata 120 
ctctttgttt ctagcataac caaaagattt agtgaattga aaacaatgtt cccttagagt 180 
ataggtctaa taaccccgaa aatattacca tgatactgag catttgtaag tatctcatag 24 0 
catgtagtat ccatagtcca tcaatgagag agacatttaa catgattttc attaatcagg 300 
tggaaaagac atgacaacat tcacaggcac tgcacagaac atagtggtcc accttgcaca 360 
tatttcacta aactaggttt atctattttg ttgctttctc taacatctct gcaatgaagc 420 
aggtcaacag tgccacatat cctttactta acctaaggaa cacaaaaaat tttctacata 4 80 
tatcctggtt agagagtgct taaaataagt tttccaagaa tggaaaagaa atgttctgac 540 
ttaacaatta agacagtatt tatttaaagc aagaaatatg aggcacacaa gaaaatattt 600 
t^999e^^9^6La ccatttggtg aacaatattt caaataaaaa tagacaaaca tagttaattg 660 
taaaacatat gtttgaccag cccttctttt caataggctt aatgtgaata aaatgttaaa 720 
gattctcttt gggtggctgc aaattgtcca cgaataagac aaaatataaa aataaggact 780 
gagtctcaca aaatgaaaag gaaatatatt cagaaagaga atcttgagag aatgtgttgt 840 
cacaaattaa agaaaacctg tggtgaatga catcctgagg cctgagctat tactgacatt 900 
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taagataaag gtaactgtat acatttgtcc cattgagggg acaagaaagc tgctctcatg 960 
ttcagctcta taattcttgc cttaaacaac ttiaaatagaa tgattitaaaa tatggagctg 1020 
tccatggacc tttgaaatat aaaatagtca agcaacttat caaggaatta cagattcctt 1080 
gatactaaca caggtaaatc ccacacgtgt tttgagacta catttgctgg gattttattg 1140 
atgtaatagg tcacatgttt ttcgggccaa tgttgctgtt attcggttac ttcaagagaa 1200 
tagtggcaac tgatgctatg tattctaggg gtttgaagtg atgtttcatg attgaaattt 1260 
gtaaaagaat aacatcatca ttcttaacaa tagaacatat aaagtcacac agaagtgaca 1320 
gtgtttaagc tgtactattg atcaaagaaa tttattacct tcagtttcaa tggaaataat 1380 
tactgataat acaaacatgt gtgaacacac actaatccta tccaaatgca cagtgataca 144 0 
cagaaaatat tagcaagtag aatgcaatat ttiatataacg attgtattta tcaatcaatt 1500 
gtatgtatca atatatgggc tattttctta cacatgattt tattcaaatt tactctaatc 1560 
attgttgaac catttagaaa aggcatactg gcaacttttc cttacctcat ccagctgggc 1620 
aaaagtccca gtgtggagta aaggatgcaa gatttcctgc tct-gttaagt ataaaataat 1680 
agtatgaatt caaaggtgcc attcttctgc ttctagttat aaaggcagtg cttgcttctt 1740 
ccagcacaga tctggatctc gaggagcttg gcgagatttt caggagctaa ggaagctaaa 1800 
agccgccacc atgaaagcca tcttaatccc atttttatct cttctgattc cgttaacccc 1860 
gcaatctgca ttcgctcaga gtgagccgga gctgaagctg gaaagtgtgg tgattgtcag 1920 
tcgtcatggt gtgcgtgctc caaccaaggc cacgcaactg atgcaggatg tcaccccaga 1980 
cgcatggcca acctggccgg taaaactggg t^ggctgaca ccgcgcggtg gtgagctaat 204 0 
cgcctatctc ggacattacc aacgccagcg tctggtagcc gacggattgc tggcgaaaaa 2100 
gggctgcccg cagtctggtc aggtcgcgat tattgctgat gtcgacgagc gtacccgtaa 2160 
aacaggcgaa gccttcgccg ccgggctggc acctgactgt gcaataaccg tacataccca 222 0 
ggcagatacg tccagtcccg atccgtitatt taatcctcta aaaactggcg tttgccaact 22 80 
ggataacgcg aacgtgactg acgcgatcct cagcagggca ggagggtcaa ttgctgactt 234 0 
taccgggcat cggcaaacgg cgtttcgcga actggaacgg gtgcttaat^ ttccgcaatc 2400 
aaacttgtgc cttaaacgtg agaaacagga cgaaagctgt tcattaacgc aggcattacc 2460 
atcggaactc aaggtgagcg ccgacaatgt ctcattaacc ggtgcggtaa gcctcgcatc 252 0 
aatgctgacg gagatatttc tcctgcaaca agcacaggga atgccggagc cggggtgggg 2580 
aaggatcacc gattcacacc agtggaacac cttgctaagt tt:gcataacg cgcaatttta 264 0 
tttgctacaa cgcacgccag aggttgcccg cagccgcgcc accccgttat tagatttgat 2700 
caagacagcg tt:gacgcccc atccaccgca aaaacaggcg tatggtgtga cattacccac 2760 
ttcagtgctg tttatcgccg gacacgatac taatctggca aatctcggcg gcgcactgga 2820 
gctcaactgg acgcttcccg gtcagccgga taacacgccg ccaggtggtg aactggtgtt 288 0 
tgaacgctgg cgtcggctaa gcgataacag ccagtggatt caggtttcgc tggtcttcca 2940 
gactttacag cagatgcgtg ataaaacgcc gctgtcatta aatacgccgc ccggagaggt 3 000 
gaaactgacc ctggcaggat gtgaagagcg aaatgcgcag ggcatgtgtt cgttggcagg 3 060 
ttttacgcaa atcgtgaatg aagcacgcat acccgcttgc agttitgtaag gtataaggca 3120 
gttattggtg cccttaaacg cctggtgcta cgcctgaata agtgataata agcggatgaa 3180 
tggcagaaat tcgccggatc tttgtgaagg aaccttactt ctgtggtgtg acataattgg 3240 
acaaactacc tacagagatt taaaaaacct cccacacctc cccctgaacc tgaaacataa 33 00 
aatgaatgca attgttgttg ttaacttgtt tattgcagct tataatggtt acaaataaag 3360 
caatagcatc acaaatttca caaataaagc atttttttca ctgcattcta gttgtggttt 342 0 
gtccaaactc aticaatgtat ctitatcatgt ctggatcgat ccccgggtac 3470 



<210> 6 
<211> 5421 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SV40/APPA + 
intron plasmid 

<400> 6 

cgagattttc aggagctaag gaagctaaaa gccgccacca tgaaagccat cttaatccca 60 
tttttatctc ttctgattcc gttaaccccg caatctgcat tcgctcagag tgagccggag 120 
ctgaagctgg aaagtgtggt gattgtcagt cgtcatggtg tgcgtgctcc aaccaaggcc 180 
acgcaactga tgcaggatgt caccccagac gcatggccaa cctggccggt aaaactgggt 240 
tggctgacac cgcgnggtgg tgagctaatc gcctatctcg gacattacca acgccagcgt 3 00 
ctggtagccg acggattgct ggcgaaaaag ggctgcccgc agtctggtca ggtcgcgatt 360 
attgctgatg tcgacgagcg tacccgtaaa acaggcgaag ccttcgccgc cgggctggca 420 
cctgactgtg caataaccgt acatacccag gcagatacgt ccagtcccga tccgttattt 4 80 
aatcctctaa aaactggcgt ttgccaactg gataacgcga acgtgactga cgcgatcctc 54 0 
agcagggcag gagggtcaat tgctgacttt accgggcatc ggcaaacggc gtttcgcgaa 600 
ctggaacggg tgcttaattt tccgcaatca aacttgtgcc ttaaacgtga gaaacaggac 660 
gaaagctgtt cattaacgca ggcattacca tcggaactca aggtgagcgc cgacaatgtc 72 0 
tcattaaccg gtgcggtaag cctcgcatca atgctgacgg agatatttct cctgcaacaa 780 
gcacagggaa tgccggagcc ggggtgggga aggatcaccg attcacacca gtggaacacc 840 
ttgctaagtt tgcataacgc gcaattttat ttgctacaac gcacgccaga ggttgcccgc 900 
agccgcgcca ccccgttatt agatttgatc aagacagcgt tgacgcccca ccaccgcaaa 960 
aacaggcgta tggtgtgaca ttacccactt cagtgctgtt tatcgccgga cacgatacta 1020 
atctggcaaa tctcggcggc gcactggagc tcaactggac gcttcccggt cagccggata 1080 
acacgccgcc aggtggtgaa ctggtgtttg aacgctggcg tcggctaagc gataacagcc 1140 
agtggattca ggtttcgctg gtcttccaga ctttacagca gatgcgtgat aaaacgccgc 1200 
tgtcattaaa tacgccgccc ggagaggtga aactgaccct ggcaggatgt gaagagcgaa 1260 
atgcgcaggg catgtgttcg ttggcaggtt ttacgcaaat cgtgaatgaa gcacgcatac 1320 
ccgcttgcag tttgtaaggc agttattggt gcccttaaac gcctggtgct acgcctgaat 1380 
aagtgataat aagcggatga atggcagaaa ttcgccggat ctttgtgaag gaaccttact 1440 
tctgtggtgt gacataattg gacaaactac ctacagagat ttaaagctct aaggtaaata 1500 
taaaattttt aagtgtataa tgtgttaaac tactgattct aattgtttgt gtattttaga 1560 
ttccaaccta tggaactgat gaatgggagc agtggtggaa tgcctttaat gaggaaaacc 1620 
tgttttgctc agaagaaatg ccatctagtg atgatgaggc tactgctgac tctcaacatt 1680 
ctactcctcc aaaaaagaag agaaaggtag aagaccccaa ggactttcct tcagaattgc 174 0 
taagtttttt gagtcatgct gtgtttagta atagaactct tgcttgcttt gctatttaca 1800 
ccacaaagga aaaagctgca ctgctataca agaaaattat ggaaaaatat tctgtaacct 1860 
ttataagtag gcataacagt tataatcata acatactgtt ttttcttact ccacacaggc 192 0 
atagagtgtc tgctattaat aactatgctc aaaaattgtg tacctttagc tttttaattt 1980 
gtaaaggggt taataaggaa tatttgatgt atagtgcctt gactagagat cataatcagc 2 040 
cataccacat ttgtagaggt tttacttgct ttaaaaaacc tcccacacct ccccctgaac 2100 
ctgaaacata aaatgaatgc aattgttgtt gttaacttgt ttattgcagc ttataatggt 2160 
tacaaataaa gcaatagcat cacaaatttc acaaataaag catttttttc actgcattct 2220 
agttgtggtt tgtccaaact catcaatgta tcttatcatg tctggatcga tccccgggta 2280 
ccgagctcga attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc 234 0 
acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga 2400 
gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg 24 60 
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tcgtgccagc tgcattaatg aatcggccaa 
cgctcttccg cttcctcgct cactgactcg 
gtatcagctc actcaaaggc ggtaatacgg 
aagaacatgt gagcaaaagg ccagcaaaag 
gcgtttttcc ataggctccg cccccctgac 
aggtggcgaa acccgacagg actataaaga 
gtgcgctctc ctgttccgac cctgccgctt 
ggaagcgtgg cgctttctca atgctcacgc 
cgctccaagc tgggctgtgt gcacgaaccc 
ggtaactatc gtcttgagtc caacccggta 
actggtaaca ggattagcag agcgaggtat 
tggcctaact acggctacac tagaaggaca 
gttaccttcg gaaaaagagt tggtagctct 
ggtggttttt ttgtttgcaa gcagcagatt 
cctttgatct tttctacggg gtctgacgct 
ttggtcatga gattatcaaa aaggatcttc 
tttaaatcaa tctaaagtat atatgagtaa 
agtgaggcac ctatctcagc gatctgtcta 
gtcgtgtaga taactacgat acgggagggc 
ccgcgagacc cacgctcacc ggctccagat 
gccgagcgca gaagtggtcc tgcaacttta 
cgggaagcta gagtaagtag ttcgccagtt 
acaggcatcg tggtgtcacg ctcgtcgttt 
cgatcaaggc gagttacatg atcccccatg 
cctccgatcg ttgtcagaag taagttggcc 
ctgcataatt ctcttactgt catgccatcc 
tcaaccaagt cattctgaga atagtgtatg 
atacgggata ataccgcgcc acatagcaga 
tcttcggggc gaaaactctc aaggatctta 
actcgtgcac ccaactgatc ttcagcatct 
aaaacaggaa ggcaaaatgc cgcaaaaaag 
ctcatactct tcctttttca atattattga 
ggatacatat ttgaatgtat ttagaaaaat 
cgaaaagtgc cacctgacgt ctaagaaacc 
aggcgtatca cgaggccctt tcgtctcgcg 
cacatgcagc tcccggagac ggtcacagct 
gcccgtcagg gcgcgtcagc gggtgttggc 
tcagagcaga ttgtactgag agtgcaccat: 
aggagaaaat accgcatcag gcgccattcg 
cgatcggtgc gggcctcttc gctattacgc 
cgattaagtt gggtaacgcc agggttttcc 
gccaagcttt acactttatg cttccggctc 
aatttcacac aggaaacagc tatgaccatg 
aaataacctc tgaaagagga acttggttag 
ggaatgtgtg tcagttaggg tgtggaaagt 
aaagcatgca tctcaattag tcagcaacca 
gcagaagtat gcaaagcatg catctcaatt 
cgcccatccc gcccctaact ccgcccagtt 
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cgcgcgggga gaggcggttt gcgtattggg 2520 
ctgcgctcgg tcgttcggct gcggcgagcg 2580 
ttat:ccacag aatcagggga taacgcagga 264 0 
gccaggaacc gtaaaaaggc cgcgttgctg 2700 
gagcatcaca aaaatcgacg ctcaagtcag 2760 
taccaggcgt ttccccctgg aagctccctc 2820 
accggatacc tgtccgcctt tctcccttcg 2880 
tgtaggtatc tcagttcggt gtaggtcgtt 2940 
cccgttcagc ccgaccgctg cgccttatcc 3000 
agacacgact tatcgccact ggcagcagcc 3060 
gtaggcggtg ctacagagtt cttgaagtgg 3120 
gtatttggta tctgcgctct gctgaagcca 3180 
^gatccggca aacaaaccac cgctggtiagc 3240 
acgcgcagaa aaaaaggatc tcaagaagat 3300 
cagtggaacg aaaactcacg ttaagggatt 3 360 
acctagatcc ttttaaatta aaaatgaagt 3420 
acttggtctg acagttacca atgcttaatc 3480 
tttcgttcat ccatagttgc ctgactcccc 3540 
ttaccatctg gccccagtgc tgcaatgata 3600 
ttatcagcaa taaaccagcc agccggaagg 3660 
tccgcctcca tccagtctat taattgttgc 3720 
aatagtttgc gcaacgttgt tgccattgct 3780 
ggtatggctt cattcagctc cggttcccaa 3840 
ttgtgcaaaa aagcggttag ctccttcggt 3900 
gcagtgttat cactcatggt tatggcagca 3960 
gtiaagatgct tttctgtgac tggtgagtac 4020 
cggcgaccga gttgctcttg cccggcgtca 4 080 
actttaaaag tgctcatcat tggaaaacgt 4X4 0 
ccgctgttga gatccagttc gatgtaaccc 4200 
tttactttca ccagcgtttc tgggtgagca 4260 
ggaataaggg cgacacggaa atgttgaata 4320 
agcatttatc agggttattg tctcatgagc 4380 
aaacaaatag gggttccgcg cacatttccc 444 0 
at:t:attatca tgacattaac ctataaaaat 4500 
cgtt^tcggtg atgacggtga aaacctctga 4560 
tgtctgtaag cggatgccgg gagcagacaa 4 620 
gggtgtcggg gctggcttaa ctatgcggca 4680 
atgcggtgtg aaataccgca cagatgcgta 4 740 
ccattcaggc tgcgcaactg ttgggaaggg 4800 
cagctggcga aagggggatg tgctgcaagg 4860 
cagtcacgac gttgtaaaac gacggccagt 4920 
gtatgttgtg tggaattgtg agcggataac 4980 
attacgaatt cggcgcagca ccatggcctg 5040 
gtiaccttctg aggcggaaag aaccagctgt 5100 
ccccaggctc cccagcaggc agaagtatgc 5160 
ggt:g1:ggaaa gtccccaggc tccccagcag 5220 
agtcagcaac catagtcccg cccctaactc 5280 
ccgcccattc tccgccccat ggctgactaa 5340 



BNSDOCID- <WO 0064247A1J_> 



wo 00/64247 



PCT/CAOO/00430 



ttttttttat ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt 5400 
gaggaggctc gaggagcttg g 5421 



<210> 7 
<211> 17732 
<212> DNA 

<213> Artificial Secfuence 
<220> 

<223> Description of Artificial Sequence: Lama2/APPA 
transgene 

<400> 7 

tcgagagtat ctttgtcagc tgtgcctcca acaaaggggt actgttgccc acatagaaag 60 
atctaaacta attaattaat ccctcacccg caaatctttc agtcactaag ttagcacgat 120 
tgttgaacaa gttctccaaa ggagagatac agatgagtgc gtatagggtg gacctggctg 18 0 
ctgaggagac acctgcatct gactaagaag agccacggtg ttagttgaat ggtgtggagt 24 0 
agggtggttc tgtgggacag tagaaaatcg agaggcatgt gccgtttagt gaactgatgg 3 00 
aagctacccc aaacgacaga gattgtcagt caggccaatc cgtttcgagt ttgatgggca 360 
gccggacagt gagacagaca cacctactca gttggaggaa ggatgagaac aatggccagc 420 
agggattgag agaccctgac aggcgcaagg ccctaacaca cacacctacc acctcacttg 4 80 
acaaagctgc caaagaccaa agacttgttc tccattagaa atgacagctg gcttgacccg 54 0 
acagcataat aagcagagtg tactctgatt ggagaacttt aatgtgtttc attcagtatt 600 
ataaaaggac agtattacag attttgttgt acactgctgt tacatgtggg gcagtgtgtc 660 
tttaagtagg gtaaagtact ctttaaaaat gggtcctaga tattttttcc tttaactcaa 720 
gtctcttact gtttaaatga tttttatttt gtttaatatg gaggaaaaag aagcgtaaat 780 
ggacaatata tatttagaga aagatggtta gctgtcagaa aaatatgcaa atcaaaatca 840 
caccaagact gcagcacacc cctgtcagat ggctgtgatc aagaaaataa atgacaatga 900 
gtggtggtga agatgtacta aagggaaaca cacacacaca cacacacaca cacacacaca 960 
cacactggag caaccactgt ggaaatcagt atgaatggtc ctcaaaaacc tgaagataga 1020 
gcggggcgtg gtggcataca cttttattcc cagcactggg gaggcagagg caggtggatc 1080 
tctgagttcc aggccagcct ggtctatagc acaggttcta ggacagccag ggctacacag 1140 
aaaaaccctg ccttgattaa accaaaccaa accaaaccaa accaaaccaa accaaaccaa 1200 
accaaaccaa accaaaccag accaaaccaa aacactgaag atagaacttc agtattccat 1260 
tcctagatat atacccaatg gagactaagt cagcaagaca cctgcacagc catgttcact 1320 
actacactgt tcaccacagc caggctgtgg aaccagcctg agtgtccatg ataaatgaat 1380 
ggataggtaa ctttcaaggt aaatggactc tgctgtgtac atgcctcaca ttctgtttat 1440 
tcatttttct ttatgaggtg tccattcagg agtcacatgg tagttctatt ttcagtcttc 1500 
tgaagatact acactggtcc ccacagttta cacttttatc agcagtgaat aagggttcct 1560 
ctatccttac catcatttgt tgtaattttt cttgatgacc ctctttctga cagggatagg 162 0 
atgtaatatc agtgtgagga agtacaactt gttttctaag tatttattgg ccccttgcat 1680 
ttcttctttt gaaaactgtc ggttcctgac atctgctcag gtattcattg gatgttgttt 1740 
ctttggtgtt tgagttctta tgaattctag atgttaaatc cctgcctgtg gttctctccc 1800 
attctgtagg ctgcctcctc accctggcaa ttgttgtcct tgttttgcag aaacttttga 1860 
cttcatggaa tctcatttgt cagttttccc tcctctgcta tagcctgagc taatgcactg 1920 
gtttttacag agccctggtc tatgccttta tcctcctctg gcagcttcgg agtttcattt 1980 
cttacattta gatctttgat ccactttgaa caagttttgg agcagggtga gagatacgaa 2 04 0 
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tctagttcca ttcttccata tgtgatccta gtttacatag catcgttggt tgaagaggtt 2100 
ttattttatt tttaaataat gtgtcataaa aaacgaggtg gttgtagcag tgtggatttg 2160 
tttctttgtc ctttgatcta caggtcttgt tttgtgtcag tctcatgatg ttttattgct 2220 
atggctctgt catacagtct gaggtcaggt attgtgatat accttcagta ttgctccctc 2280 
agactcaggt ttgctttggc caggagtcat cttactcagt gctcttagag ctcccccagc 234 0 
atgtagctgc tactattctt agttgataaa tcaggaaact ggggctcaga gagattaact 2400 
gt:cttgaact acttctgggg aggtgaaacg tggagacact aaactgtgtt taccctgtac 2460 
tgctccagta gctgtcgggt gctgggctac agcaaagcac ctatactata tattactcag 2520 
gaggtggaaa aactcagcct cccttggggt tcccaagctc ccaggtgtcc agtcactgct 2580 
ggaaacctca tggagtctga aaggaagggt tgagggtaca tggggcagcg atgaggagcc 2640 
tggggctggg atctcccaaa cacctggata tccagatgcc actgggtcag ggggagttgg 2700 
gaacagagtt gggatgtcca tggacctgtg acaaggccag ggccaggggg aggataactc 2760 
tggctttact aatttgcgaa agtccttagc ttagcagcag ttgtctggga gcacagaggg 2820 
gccttctgta agaggctcag gcagtgccgc tctgtaggcg aaggtcttct ccatgttccc 2880 
catggtggtt cttgatgaaa gagacagtcc ttggctccaa actggtttat tgattgttca 2 940 
ttgtggaaaa tgggtgcaca ccaccttctc agggtggacc agagatcaaa taccttttgc 3000 
agggaggaat atctgggaag ggacgcttac tggctaaacc ctcagggcct ctagatacat 3060 
cattagcatg gagaactctg ttctgggcta catgaccaca ggccacattt ccacaagcca 3120 
catgtgggaa gtgtggcaca tgttctaggc caggaatctg gtagggagcg tggagccacc 3180 
taccatccca ggtgggtgcc tgggtgccag ggaccctgaa cccgctcaac cttaccaagt 3240 
ttcctggcag ggtccactgt cctacacaga agctggagga ggtgtgaggg ttgtgtcttt 3300 
gtggaatgtc ccatgctgct tggggctcag tttctccacc tgtacctcat tggtttgggt 3360 
ataaaaagtg gggatacttt attattctct gactcggtcc tgaggaaaaa gcatcgtggc 3420 
agtccaggaa ccacaccctg aggttcctgc actgaaggga ctccctaagt ctctggagtc 3480 
tct:ccccttc acagagctgc caaagtctag gttctt:t:tga ggataacaga gccatgcttg 3540 
gtaagcagac aacagcattt gtttacticaa ccttcttttg tcagctccct cttcataaac 3600 
aagttgagac accatgctgg cttgaggaag acttctaaag ccagacaact gtgcaaggaa 3 660 
gaagaagaag gggcaagtgg agt:t;agcct.g gatgtagccc tcaaagtctc cagagaccag 3720 
ccatgaaggc tcaagtggag ggcaagacct gcagcagcca agcatctggc aggagaggat 3780 
cctgggaacc cctctaccat gacacacatt cttcctgcag gtcacactta ataggccatt 3840 
tcttatttgg atctatcatg gtgttctgtg cgagattaat gaggtgttat gctgcgaaca 3900 
gaaagttata taaaaacaag tccccccccc ttgtcactgc tgctaagaat gtagcagaaa 3960 
ttgtctcaag tgtctctcta atcagaaaca ataaaggtct ccttggattc aagccctcca 4020 
gtttcctcct tccttgctga gccttggaca cccatacaaa cctcctggat gctacagctc 4080 
tgggcagaga ctccaaggtg gggagagact gatggtacaa aagcaaaata cttgtttggg 4140 
ggtacaccca ctcctctgcc tgtgtggttc ctgcagtcag tcctgcagac aggccctcag 4200 
tgggtcttcc atgggcaaca cgcagaggga ggcaatggat gggaataccc acaccctggt 4260 
^agtttaccc cggccatgct ctctgctctt catccctcct ctgccctctg ccacggcttt 432 0 
ctctgcagga atcatatctt catattggcc cacaggtgtt ctcctcaccc tagctatgat 4 3 80 
gtttacttta gagtgacctt agcagggctg gtgggaatga gttctagaag gctcacggag 4440 
atgctaggga agaaacgtct tctaactact gaggttacta agttcctggt ggttgtctct 4500 
gcctttccct tgttaaagtc accttgaagt tagtgcagaa gaaatcagag cccagtcaca 4560 
gagtaaatat ggtcctgaag att:tcctttg agtgcccaga atccatigaca tttcaagagc 4620 
cctctttgta ccttaagtca tttggggttg tatcttctgc ttgatgtatg tgtgtgtgtt 4680 
tatcaaagag tgagatggtt acataagagg tgctctaaag gacagagagg atttgcaatt 4 740 
gtggcatgtg acatcctcag gcctitgctict: ggtgccagga ggaactgatg cagaaaagag 4 800 
taagaggtca tttcctggag gctgtcacta tagaggagat cttacagtgc attccctcct 4860 
ccaggccctg cctgaggata gacatgtgct gactgcaact gaaacagagg cttgggatgg 4920 
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agagttaggt tcacagaagg gagggtggga gatggatgct tgctgggttc tgggtctcat 4980 
caccagctcc tgaccacccg gtcagcccat gtgcttattc catagct:t:t:c ttttgctatg 5040 
tttactcagt gtggtgtttg ttgggaccca gcagaagcca gtcccaggct gacagctgtg 5100 
gatacacagg gcagcatgag ggtcctcagc ctgaagcagt caggctggca gaagagaaag 5160 
accagcacac at:t:ccttcaa ccaactatgt cttgaaaaac aaacatatta tatcacatat 5220 
attgcattta tgagacagct aaaatgtact cgggtagcat gactccaggt ggggatatct 5280 
gcaagtgcca tgagtggcag agggacagcc aatgtgaggc aagaaggaat tctggctcaa 5340 
cacagcttag ctccctggtg ttggttcaaa ctttgagagt ttgaccacaa gcactttatt 54 00 
tttgacatat ttaaacagag cacaactttg ggaaaaagtt ttcttatgaa aattatcaca 5460 
ataaagctta aggcatgact acattaaaat gcctttgcaa agtatatgtg ccctcttcca 5520 
caagaatggt tctattgact gagaaataat gttcaggata aagatccagg aagaaaagat 5580 
cagggataag taaaatacta aactcttttg caaagtacat agaccctctt tcataacaat 5640 
gggttctatt gactgacaag cactgctcag gagttgggaa agagtctagc ataagcacga 5700 
tagcctggag actctagtga ggtctagtct tacagacagc aaaaatcacc aggttacaaa 5760 
ctacattcat tt:ccagt:ttt ctgatcaggc acaggtatga aticccttctg ttgaagagaa 5820 
aagtccatgt gtttaaaata tctggtttct ccagtgctat tagcgagaag acttgagccc 5880 
tatacaactc ccacctggag tgacatcctg tcttcatggt atattacata cctagacacg 594 0 
ctcatctcac agacttagga ctttgtcttc tgatctccat ttctgatccc acttccacct 6000 
ttgccttgat agtgtcattt tcttcactgc cttggtgaca accatgttat cctctgtgta 6060 
tttgagtgtt accattttca gattttacct gtatgcaaga tcacacagtc tttgtctttc 612 0 
tgtctggatg catgctaatc tctacacaac aacccttccc cgtcactcag atcttcctcc 6180 
attaacacat acatggtgct gaagaggcta gggagcttcc cttcagtggg gagctagctg 6240 
gctattgggc ct:ttttgact gtccaggaag gcccccaatt gctgagacaa gaacttagat 6300 
tcttcattat t:gact:ctaac tcatgtatca agcagaagct aatgaatagt tatcaacagg 6360 
atcagaggtt ccagtgtaag acactttgac atgaaagaac ggaggaagga cagatggatg 642 0 
cataaaagca ggaccactgc cccaggaagg tcctggaaac tgatgcaggg caaaggacag 6480 
gttataaacc aaatcttagg gagtcaggaa gagcacagag gagct:caacc aactgaccac 6540 
tgcttagggg ctaccaaccc aatcctccct gtgggaacag ctaagctatc agccaagggt 6600 
aataaacagg caggacctgt ggatigacatg gagagcatag ggaccctiggg tccagccttt 6660 
agcacctgca ctctcaggat actccaccat tgtgtcttag agagcctagg gatactgggt 6720 
ccagcctttg gtaccttcac tctcagggta ccccatcact gtgtcttgga gagcctaggc 6780 
accctgggtc cagccttcag tacctgcgct ctcaggacac cccaccattg tctcttgccc 684 0 
cgtctcttct tcctcttcct ccctttcatt gtctcttctc tgtttctttc ttgactctcc 6900 
tttcccctca caccctcact ctagttctcc ccttccctct ctgcatcacc ctattctctc 6960 
tgtggtccct ccactttcct ttatctctca tgcttctctc ctccctcaaa tacttgtcac 7020 
ccactatact tcaggggcca gctctagtga caaagctgtt aat:agcaaga ctctcagatc 7080 
tccaacggct cagaggagcc agacccacca agaactctct ccaggtccaa tttcaggttc 714 0 
cttcgaaagc ttitcagcaaa tgctcaggga acatgccact aacaagaaga tgcaaattcc 7200 
agttgagagt gggaaaggcc cttgcgtagg tcccatcttc caggccaagg tcagaggggc 7260 
tctgtgtaat ccggattgac agggctcaga acaatgtttt gtttttaagg tttatttatt 7320 
ttaggtgtta gtgtctttgc ttgcatgacc ttatgtgcat catgtgtgtg caggttcctg 73 80 
atgacagtag aggagggctt tgaatccctg gggataggaa gttacaggaa attataagct 744 0 
gctttgtggg tcttctagct ttcccaacag aagtgaatgc tcttcaccac tgagccatct 7500 
ctctaggccc aagagacatt gctttatgga t:ataatt:gt:g tgtgtgtgtc aacatcgagg 7560 
aaagggaaat aaaaaaaaaa cttcagccgc taaggttgta cagtttcact aattgctact 762 0 
tttagttgtg a^aaaatggc aggtgcttca acatttatat atacaaaaac ttccctgctg 7680 
gtggttcaac tgtgagaact ggggtaagtg ggtgagttct ctttttctgt ctctgtctct 774 0 
gtctctctcc ttccattctt tcttaaagga aataaacatt gcagctgggt tatagctcat 7800 
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caatatggaa gttacagaag tgaaaaaagg cattgccttg gtgggtggtg ttaccagctg 7860 
atttttggtt gtcctgcaag gaggtctggg gactggctgc tctgtctctg tctgtatgag 7920 
tgagggaagt ctggggagca gattccctaa ccttcagcct ggcctggttc ctgagtgaac 7980 
ccagcctctc tggtcctagt agctttttcc aaacaggaat ctgagtggtg acagggaaca 8040 
agtaccagcc cattgcttaa gtgccagggt tagtgagggc aggaagctgc catagctggg 8100 
attagtagtt gtattggatg taggaagtcc tatcctggga cagctaatcc ttaatgcttc 8160 
actggagatt ttcaatgaga aatttatccc acggcccata tggccccatc cttttgtctc 8220 
caacagccaa gtattttcca ttagaggaga cttcctgtac acttgatgga tgctcattcc 8280 
aaggtgactt ggggcagtca gtacagactt gggatgacct ctgacagcct aacctctccc 8340 
caacaagggc cctctatgtt tgctatgtaa tgtaatgtca gacattgtca ggagtgtccg 8400 
cagcacagcc tgcccagtgt gagggctctc ataggtttcc cactgtctta tctacacagg 84 60 
gataacgagg aggtaagctg cagttcccag tctcacttca cagaggaaga gataacccca 8520 
tcccaggtca tgtagccagc agtggaaaga atgaggattt gaactcaggt cttccaagtc 8580 
ccattgatag catctcctca caagtccctt gccaccctca cgatgcctta gacacttgcc 864 0 
tgccctttat actaaggaga tgcaggtaca aggggtttac ccatgtagca gctgaggcag 87 0 0 
ctggggatag ataccagcag caggcctgat gtcaccactc taactccagc atccccagtc 8760 
tgtgttcctg gagtgtgaaa atccctactt aacaagattg tgcaacagtc cttggctctg 8820 
tgacccatag ctggaaacag gattctcatt gatttgtgga acatggtggc agccagccaa 8880 
aaagagggtc tgcatacaga agacacgtgt ggcaaggcca cagcagactc tgactacctt 8940 
agcttacaga attacaaggt cataacgtcc tctgctttgg tcacctcatg ttaaggacag 9000 
gccctaatga agatggggca gaagactgaa ggaatggcca accaataact ggcccaactt 9060 
gagacccatc ctacaggcaa gcatcaattc ctgacactac taatgatact ctgttatgct 9120 
tgcagacaga agcctagcat aact:at:cctc cgagaggtcc acccagcaac tgactgaaac 9180 
agaaaaagat atccacaggc aaacagtgga tggaggtcag ggactattat gggagagctg 9240 
tgggaaggat taaaaaccct gaaggggata ggaaccccac aggaagacca acagagtcaa 9300 
ctaagagacc tgtgggagct ctcagagact gagccaccaa ccaaagagca tacacaggcc 9360 
ggtccgaggc acctggcacg tgtgaagcag acatgcagct cagtctccat gtaggtcctc 942 0 
caataagcgg tagcctgact gcagtatcca at:ccccaaca gggctgcata gtctggcctc 9480 
agtgggggag gatgccccta atcctgcaga gacttgatga gtggagagct atccaggggg 9540 
aacccaccct ctctgagaag ggaatgggga tgggggaggg actctgtgaa gaggggacaa 9600 
ggacaaacaa gaacctcaaa taggtcaggc cctaaaggct tgctaagnag cagtggccca 9660 
gctctgtcct gttcctcagc ccaaggctca gctcccacct gtttctgtgt ttttctggct 9720 
tttcatgggc ctaggacttg gtgaccagtt caaacaatgg ggcctgtgga agacacaata 97 8 0 
tacaagacta gggacattcc tgttctgctg actatccata gcctgatgta ggtggaagga 9840 
cccaatcact ggatttctac cctt:gcacaa ccttgacagc t:gagggcctc tcagaaacct 9900 
atttcttcca ctgaaaaatg agactctcaa atgaacgtcg tgacaatcat caggcttatt 9960 
aaagaggtgt atctaacctg aatggcaagc agacagcagg caaatgtctg tatcaacctc 1002 0 
taggaaggac aagaactgct cactgctgcc ccccaggagg ccatttgctg aaacagctgc 10080 
tctcctgctg gtgcacaggc cctgccttct cattgcagcc acagcccctt cctgtctgaa 1014 0 
cctcctgtca ggtcactggg aaacagatca agatggaaca ggacagctcc tgatggtaaa 10200 
taaaaaacag tggtcatggc tatl:catagg ggtttatgct tcttcagtcc acactgtgaa 10260 
gagctgtggg catgaaccac agtgttcgag gtagagttgg ggttctgaaa ttcacagtgg 1032 0 
ggtgagctca gtaaatgtga gctggaggtc actcgtgaga cacacagtcc tgctgcttct 10380 
gtccccaata tcctgaggag acgacacatc tactttgttc agaggccaca gtctagttga 10440 
cctgagagtt accagtttct tatttgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 10500 
tgttgtt:cgt gtgtgagtgc aggtgcacat atgatagcgt acacgt^gag gtcagaggat 10560 
aactatcagg cgttgtcccc tcctactttt cctcggactc tggagaacaa acatgggtcc 10620 
ttattccagg ggagcaagtc gctgttggct gacacatctt gctcacatac attttaccta 10680 
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gacaatggag cctccatcag agtattactt tagctcctca ccgatggcaa Cgcaccacct 10740 
ctctacccac ataggagttg ggtctccaca cacccccaca cccccttcac caaaacgttt 10800 
tcagttactt tatctggtaa agttcatcag agaatgaagc cagtattaag aacatggaat 10860 
catttgggaa cctggatcta gcaatacccc accctagatg gagttgctga gttttcacct 10920 
cagattataa ttccccccta gcttctatgg tttattctga aaccagggga actcgattcc 10980 
tccctttgga ccacagacat cctggcttgt gaattcacat gtcatctact gctaatccat 11040 
tggtagtatg tggctcacag agacacacta cagtcatggc caatgtcaag gtaggacaga 11100 
tgtgaatcat tcccccagtc ctgctgtttt catgactaac cctcctcagc acagtgacca 11160 
tgaacctact tttcccctcc ttttattttt agaattgctg gaattttcta ttttgagaaa 1122 0 
t.aatagcctt gggcagcatt aaacaaaatc atctagaaag ctggtttaaa atacagatgg 11280 
ttgagtcagt gaaagagtga ggaatgtcat tattggcccc tcacagaggc tggctcactc 11340 
cagcagaggt ggttgaagct cttggacacg ggtcaggtgc ataggaaagg tngtctggga 11400 
cactgagaac cacaattgaa caaacagaac tgttggcttt ttttttttta aatgagttct 11460 
caaaaaatga ctggctagct taggcaaata cttcgagcca acccaacaga acattcttcc 11520 
attgattcat tctggatctt ctttctagac aatactgaac tgaccccttg ttggcagtct 11580 
caagtttgac aacatagggc tttgaacttg gcacaaggtc catcactgtc acccaagcat 11640 
cctgggtgac ctttgggttg gaatatcttg gctaacctta gatattttct ttggagtatc 11700 
tttagaacat ccaggaaata gggcttgatt ctcatcctgg gaccacaata taagtcaccc 11760 
tiagaatccca ggagatcgtg cagagaaaca aggatctctc tcgtgtgcat ccttcttcaa 11820 
agcagtgagt agtgactcca ctaaactgag ttcccatctg agagtccaca ggaggctttg 11880 
gggcaagaag cagagggaag gcactgtttg tgttggtaaa gttttgactc taacaaattt 11940 
gaagacatag atgacattgt gtcagactaa caacaaccta gactcatgtg ggttctgttt 12000 
agggatcaga ttttattcat caatgacttg tcttagtgta tagagaaagg cttcctactg 12060 
gagtgtaggc tcaataatga cagaagagat agctatttcc cctagggact gtgctgctcc 12120 
aagtttggtg gagaaaggca gtggggaacc tagatgtgct ctctggggag ggggtctgaa 12180 
gctggcttca tagaaggtgt gaagttttgc tgaaacatct aaacagaat:t at:agctt:agg 12240 
aaagtgagca ggcaaggcag ggaatgtgtt gcatatgtat atgtacatga atatattatg 12300 
ttatagatac acacacattt gaacctcatt tgcagatgac agaaaatzagg ttattttgcc 12360 
tctcttaact gctaagcaca atgacttcca gttccatcca tttcctgaaa tgccacaatt 12420 
tcatttttca ttgtggctga ataaaattcc attgcagact gggccctact tcatccactc 12480 
ctgagggcag gcatatcccc tggctccatt tcttacctat tgtgaagaga agtgcaactg 1254 0 
t:cttgtt:gaa aggcaagcgt gagagaggca ggcactaatt gtgggttt:t:t: gtttcttctt 12600 
cctgctatga ctctccattt gtcagaacca aagatcgata aaagccgcca ccatgaaagc 12660 
catcttaatc ccatttttat ctcttctgat tccgttaacc ccgcaatctg cattcgctca 12720 
gagtgagccg gagctgaagc tggaaagtgt ggtgattgtc agtcgtcatg gtgtgcgtgc 12780 
tccaaccaag gccacgcaac tgatgcagga tgtcacccca gacgcatggc caacctggcc 12840 
ggtaaaactg ggttggctga caccgcgcgg tggtgagcta atcgcctatc tcggacatta 12900 
ccaacgccag cgtctiggtag ccgacggatt gctggcgaaa aagggctgcc cgcagtctgg 12960 
tcaggtcgcg attattgctg atgtcgacga gcgtacccgt aaaacaggcg aagccttcgc 13 02 0 
cgccgggctg gcacctgact gtgcaataac cgtacatacc caggcagata cgtccagtcc 13 080 
cgatccgtta tttaatcctc taaaaactgg cgtttgccaa ctggataacg cgaacgtgac 13140 
tgacgcgatc ctcagcaggg caggagggtc aattgctgac tttaccgggc aticggcaaac 13200 
ggcgtttcgc gaactggaac gggtgcttaa ttttccgcaa tcaaacttgt gccttaaacg 13260 
tgagaaacag gacgaaagct gttcattaac gcaggcatta ccatcggaac tcaaggtigag 13320 
cgccgacaat gtctcattaa ccggtgcggt aagcctcgca tcaatgctga cggagatatt 13380 
tctcctgcaa caagcacagg gaatgccgga gccggggtgg ggaaggatca ccgattcaca 13440 
ccagtggaac accttgctaa gtttgcataa cgcgcaattt tatttgct:ac aacgcacgcc 13500 
agaggttgcc cgcagccgcg ccaccccgtt attagatttg atcaagacag cgttgacgcc 13560 
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ccatccaccg caaaaacagg cgtatggtgt 
cggacacgat actaatctgg caaatctcgg 
cggtcagccg gataacacgc cgccaggtgg 
aagcgat:aac agccagtgga ttcaggtttc 
tgataaaacg ccgctgtcat taaatacgcc 
atgtgaagag cgaaatgcgc agggcatgtg 
tgaagcacgc atacccgctt gcagtttgta 
aagaggaaga acagaaggat gccacaactc 
ttacttctga tggcatttcc ctctagaaag 
gaccacccaa aggaccctcc caaactctct 
caccatccca gaattaaaat cctaactgca 
aataagagtt gttggcagtg ccaggcgtgg 
aggcagaggc aggcggattt ctgagttcga 
gacagccagg gctatacaga gaaaccctgt 
gttggcagag tgtgggttat ataccaggtg 
ccagaaggaa cttagaggat agctcataac 
attgagagag tgggcacaca gccactgtgt 
tacatgcata agtgtatatt ggcgccatcc 
cggggttagg tggccatggc ctttcctgcc 
tatgctctct taactcttcc attgctactt 
ccttgggtac atcagtgatc ctggtgatat 
gaggct:gcaa ctaaagaggt cttct:t:aata 
agaagttcac agaggtgaag tgattcatgt 
ggattatctg actctactct aacttttatg 
ttcctgtgct tcagctctgg gagactccca 
gactctgaca ctctgcattg attaattagc 
ttgtttcact ttccatatag gctatgaagg 
gaggcaatcc acctctctca ggaagcctct 
aactgtaggc ccagtccttg gtgtccaaaa 
tccatgtgct caaaggtttg aacatggagc 
ttgagactgg atgctctttg gtcccatgtt 
ggcat:gct:ac cagctaccac agactatgcc 
tagacttgta tctcctaaaa atggaatcaa 
tttctgttaa gtgtttggtc acagggacaa 
gagttgaggt tcattgctct agcaagttgg 
ataagagaca tgtagaagag tctgaagctg 
aatagtttaa tacaccatgg gaattgtgaa 
aaaacgtgag catgtggcgt gtgagagggc 
gaagccattc ggctacgtta gggaacgtgt 
ctgaatgagg ccaaatttta aaggagtgga 
cagaccacca ctcaggctat gccgtgtttg 
ttgtgaaatt ccagagcaat tatcagagca 
ggtgtgggt:c cctaagtgga tggtgcataa 
gataatccaa aatatcagca atgtggaatg 
tagaactttg ctcatggctg taataaatag 
gtctgagtta cggttccagg gcaaacattc 
agccaaaggt cagctggtca cattgcatca 
aggat:acagg ttataaaacc tcactgtcca 
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gacattaccc act:tcagtgc tgtttatcgc 13620 
cggcgcactg gagctcaact ggacgcttcc 13680 
tgaactggtg tttgaacgct ggcgtcggct 13740 
gctggtcttc cagactttac agcagatgcg 13800 
gcccggagag gtgaaactga ccctggcagg 13860 
ttcgttggca ggttttacgc aaatcgtgaa 13920 
aggtacccgg ggatcacaac ttgccctctg 13380 
tcctgctggc tactctccag tggtttcatc 14040 
tgctactatc atccacacat ttctacctga 14100 
tcctctctga gtagtctcca cacctgttac 14160 
ctctggcgtg tgacttgcct cagtccttgc 14220 
tggcgcacgc ctttaattcc agcacttggg 14280 
ggccagcctg gtctacagag tgagttccag 14340 
gtcgaaaaac caaaaaaaaa aaaaaaagtt 14400 
gagatttcaa atgagtggct gaagctgtag 14460 
tt:aaaaagaa atgtagagag tagcagaaac 1452 0 
gaat:gt:ggca gaacacaat:c cagccagcta 14580 
tgactgatga gacacaggaa aacagataga 14640 
tgcctcttcc taagggtcat ctcaagacct: 14700 
agcttctaga tatcacctcc agattagtct 14760 
ccagggcttc ctgattccat ctttgtcata 14 82 0 
cttcacaccc tga^gccaaa aggaagacac 14 880 
aggacataca gtgagcaagc atcagggtcc 14940 
taaatgtgct ttatgccatt aacactgtca 15000 
agcactctta ggcacaagcc acaattaagg 15060 
atggtggtct ctatgtttcc agattcatga 15120 
gtgtgaggaa attttttggg gacagaattg 15180 
atctggaaaa gctt:acaact cagggacagt: 15240 
tgggttttat ggtttgaatc tgcaaagcct 15300 
ctcctcctgg taacactgta ttggaggctt 15360 
ttgcCacatc atctgtcaag atatgaccca 15420 
tctccagctt tcatgttctc cccaccatga 154 80 
agcaaacttt tcctgcatta agtttttttt 15540 
gaaaacactc aat:acagata attagtacca 156 00 
atcaaatttt tagggcttitg gaact:gattt: 15660 
tgggctacag aagtgtcacc agtttttaag 157 2 0 
aatcagaatg ctcacacaaa ggcagacagg 15780 
ataagaagga acct:aggggg aaatgagcta 1584 0 
gtggctgtgc ttggcccatg ccctggcaat 15900 
ctaactcgat tgticagagaa aatiatcaaga 15960 
tgaccgacca gctactctta gccagctcta 16020 
t:gaagataca tacagtttag tgaagtaagg 16080 
atct:atgtag gtgatgccta agtgacactt 1614 0 
ticttccaagg agacctgtag acacacattt 16200 
ctagctagaa atcatttcct gaagaggtta 16260 
agtgatggca aggaaggcat tgcagtcagg 16320 
agagtagaga gtcagagtgt gagtagaaag 16380 
ctctcagcaa tccattttct cctaaaaggc 16440 
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tttaccttct aaagatttta gtcttcaaaa ccagtaccag tagcctggga acaaaagttg 16500 

aaacaaatga gcctttgtgg ggcatttcac acttaaaaca gggcatcacc taggaggagc 16560 

cctgtgtgca gtaggaagtg tggcctctgt gtcaggaatg ctcaggctaa taaggggtcc 1662 0 

tctatctgag ggaccctatg aagattcaac aagtagttgt gagaattccc tigtaaatgga 16680 

tgctaccaat ttgacatttg tagacctgct attgtgtgct tctttattgg gctctcccat 16740 

ctcccaactt tccaacccat: attccacatt aatcccttcc accaccatgc aacactaggt 16800 

aggagagaag gaaggttaga agagaaagtg ggtatagatc tatttagact acttcctgct 16860 

gactaggggc aagtccaatc gtcattgtca ggatacctcc aaccagcaac cagcaaacca 16920 

gcaaatcaga aacagcaaaa gcagccaaca aggcagcact aaccagcagg antggggtcg 16980 

gtagcgtggg agcagtcact actggtcttc tcatggcttt ggcattaata ctctctcaag 1704 0 

aaattccgta attttttccc caccacctga aattccgtaa ttttaaatgc aaactatcta 17100 

cagctggcaa aaatcaca^c tctcctagag cacaagacaa atcatagtta ctggctattt 17160 

gcaatctgaa gcatctcaat atcccacacc tgggattaaa acaaaaacat at^cacaCca 17220 

cataactgtt ttttttttcc aattttttat taggtatttt ctttatttac atititcaaatg 17280 

ctatcccgaa agtcccctat accctcccac ctccctgctc ccctacacac ccactcccac 17340 

tttttgaccc tggagttccc cggtactggg gcatataaag tttgcaagac caaggggcct 17400 

ctcttcccag tgatggccga ctaagccatc ttctgctaca tatgcagata gagacacgag 17460 

ctctgggggt actagttagt tcatattgtt gttccaccta tagggtcgca gaccccttca 17520 

gctccttggg tactttgtct agctcctcca ctgggggctc tgtgttttat ctaatagatg 17580 

actgtgagca tccacttctg tatttgacag gcactggcct agcgtcacat gagccagcta 17 64 0 

tatcagggtc ctttcagcaa aaccttgctg gcatgtgcaa tagtgtctgc gtttggtggt 17700 

tgattatggg atggatccac tagttctaga gc 17732 
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